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PROJECT  INTREX 


Activity  Report 


I.  INTRODUCTION 

Three  years  have  passed  since  the  plans  for  Project  Intrex  were  formulated  at 
Woods  Hole.  We  have  worked  in  the  directions  that  were  set  by  the  Planning  Conference, 
and  we  have  made  good  progress.  As  in  most  research  undertakings,  the  pace  has  been 
less  heroic  than  the  vision  of  the  planners.  We  had  hoped  to  achieve  the  major  goals  of 
the  plan  by  1968,  but  it  now  seems  unlikely  that  we  shall  reach  that  stage  before  1970. 

The  first  factor  in  modifying  the  dynamics  of  the  enterprise  was  the  decision  by 
M.I.T.  to  pursue  Project  Intrex  within  the  normal  academic  environment,  rather  than 
establish  a  special  activity  outside  the  regular  university  structure.  We  realized  that 
this  course  would  make  for  a  slower  start,  but  we  felt  that  the  growing  involvement  of 
faculty  and  students  along  with  the  research  staff  would  more  than  compensate  for  the 
initial  delay.  Our  observations  of  the  development  of  the  Intrex  team  in  the  Electronic 
Systems  Laboratory  have  reinforced  this  view. 

A  second  factor  in  limiting  the  scale  of  our  operation  has  been  the  difficulty  of 
attracting  financial  support.  The  funding  of  Project  Intrex  has  not  materialized  with  the 
speed  or  in  the  magnitude  of  our  expectations.  Where  we  have  hoped  fo>-  a  budget  of 
close  to  $3  million  for  the  next  two  years,  we  are  at  the  present  time  assured  of  no 
more  than  half  that  amount. 

In  spite  of  this  lowering  of  our  sights,  we  have  derived  some  encouragement 
from  our  negotiations  for  funds,  doth  our  principal  sponsors  appointed  strong  advisory 
committees  to  review  the  operations  of  Project  Intrex.  We  thus  had  an  opportunity  of 
testing  our  own  views  of  the  projec  t's  past  and  future  against  the  judgments  of  a  number 
of  highly  qualified  observers.  We  learned  a  great  deal  from  these  discussions,  and  we 
were  pleased,  of  course,  with  the  generally  favorable  reactions. 

The  activities  that  were  reviewed  by  these  committees  are  going  forward  m  two 
places  at  M .  I.  T.  In  the  Electronic  Systems  Laboratory,  under  the  direction  of  Pro¬ 
fessor  J.  F.  Remtjes,  a  team  of  librarians  and  engineers  is  engaged  in  a  twolold  attack 
on  the  access  problem.  For  bibliographic  access,  a  digitally-encoded,  machine- 
manipulated, "augmented''  catalog  is  cinder  investigation.  The  task  of  preparing  entries 
for  1 0 , 000  documents  is  about  half  completed.  A  user -oriented  program  for  consulting 
this  catalog  is  under  test,  A  display  console  is  in  the  final  stages  of  construction. 

For  full-text  access  to  the  document.*  that  have  been  cataloged,  an  experimental 
transmission  system  is  being  developed.  Microfiche  copies  of  the  documents  are 
scanned  and  transmitted  our  a  wide-band  circuit  to  a  remote  display  station  where  the 
text  is  reproduced  either  on  storage  cathode-ray  tube  or  on  a  duplicate  microfilm. 


The  report  which  follows  brings  the  story  of  these  experiments  up  to  date  and 
provides  a  detailed  technical  record. 

In  the  Engineering  Library.  Charles  Stevens,  of  the  Project  Intrex  staff,  and 
Rebecca  Taggart,  Librarian,  are  supervising  the  preparation  of  an  operational  environ¬ 
ment  in  which  regular  library  users  can  experiment  with  the  facilities  of  Project  Intrex. 

The  physical  reconstruction  and  expansion  of  this  oldest  unit  of  the  M-I.T.  lihrai; 
system  is  in  full  progress  and  :s  to  be  completed  by  next  spring. 

The  books,  journals,  and  other  materials  that  comprise  the  collection  of  this 
library  will  be  divided  into  an  active  collection  of  limited  size  and  a  storage  collection 
ot  less  f  requer.t  1  y -us  ed  materials.  The  cin.  tents  of  the  active  collection  will  be  con¬ 
stantly  changing  as  new  materials  are  added  and  older  materials  transterred  to  the 
storage  collection.  Access  to  the  high-priority  materials  in  the  active  collection  will 
thus  not  be  impeded  by  the  growing  number  of  obsolescent  documents.  Microfilm  and 
paper  duplication  will  be  used  extensively  to  supplement  direct  access  to  the  documents. 

These  procedures  in  combination  with  the  Intrex  facilities  will  provide  experience 
with  a  blend  of  traditional  and  advanced  experimental  operations  that  will  be  character¬ 
istic  of  university  '  braries  in  the  next  decade. 


Carl  F.  J.  Overhage 
Cambridge.  Massachusetts 
i5  September  1968 


II.  RESEARCH  AND  DEVELOPMENT  ACTIVITIES  (Electronic  Systems  Laboratory) 


A.  STATUS  OF  THE  PROGRAM 


Professor  J.  F.  Reintjes 

The  computer-stored  augmented-catalog  part  of  our  Experimental  Library  Storage 
and  Retrieval  System  is  now  working  satisfactorily  for  demonstration  purposes.  Catalog 
look-up  requests  can  be  made  by  specifying  subject  terms,  authors  and/or  titles.  In  addi¬ 
tion,  the  initial  group  of  documents  thus  retriev'd  may  be  narrowed  by  requiring  that 
specified  matches  be  made  on  additional  fields  of  the  catalog  records.  It  is  also  possible 
to  obtain  a  printout  of  the  information  contained  in  any  or  all  of  the  fields  associated 
with  each  catalog  record. 

Included  in  the  Demonst  ration  System  is  a  User's  Guide.  This  Guide,  which  pro¬ 
vides  detailed  instructions  on  methods  of  interrogating  the  catalog,  is  available  online  as 
computer -stored  information  or  offline  as  a  hard-copy  manual.  We  intend  to  determine 
receptivity  of  users  to  each  form.  The  Guide  has  been  prepared  with  an  awareness  of  the 
need  to  make  it  easy  for  new  users  to  become  acquainted  quickly  with  operating  details  of 
the  system. 

For  persons  who  are  already  familiar  with  the  System,  there  is  available  a  short 
form  of  the  dialog  for  making  requests  and  receiving  responses.  The  short  form  sub¬ 
stantially  reduces  the  time  consumed  in  inputting  and  outputting  information. 

The  present  System  is  designated  a  Demonstration  System  because  the  data  base 
contains  catalog  information  on  only  -18  documents.  Within  the  next  few  weeks,  however, 
a  t  plan  to  insert  at  least  1,000  catalog  records  into  the  computer.  We  should  then  be  in 
a  position  to  satisfy  the  requirements  of  persons  who  have  a  bona  fide  need  for  the  infor¬ 
mation  thus  stored.  Further  inc  reases  in  the  data  base  will  be  made  as  we  gain  experi¬ 
ence  with  the  formatting  proi  ess  associated  with  the  inputting  procedure. 

The  Demonst  ration  System  was  operated  successfully  during  July,  ldtH,  from  West 
Berlin  during  a  two-wee1  ("entermi  e  held  at  the  Technical  University  of  Berlin  on  the 
subject  The  Computer  and  the  University. 

The  remaining  two  fvstj  of  our  library  system,  the  augmented-catalog  console  and 
the  full  -  text  -a>c  ess  equipmei  '  are  nea  r  ing  c  omplet  ion  and  should  be  func  turning  early  in 
the  next  reporting  period.  Recent  emphasis  has  been  on  the  software  and  on  the  design 
and  c'onst  rue  t  ion  of  the  interface  and  control  logic  associated  with  these  items.  A  sub¬ 
stantial  amount  of  logic  Circuitry  is  required  brtwern  the  buffer  o ont rol  Ir  I  of  the  catalog 
console  and  the  Central  Time -Sharing  Computer  and  between  the  buffer/controller  anti 
the  text-access  equipment  in  order  to  carry  out  the  various  sequent  mg  operations  auto¬ 
matically.  The  logic  circuitry  is  now  being  tonst'uctrd  The  software  pai  kag.  for  the 
buffer  controller  has  been  completed  and  is  bring  debugged. 


To  date  approximately  5.000  journal  articles  have  been  photographed  and  placed  on 
microfiche.  We  are,  therefore,  prepared  to  make  available  the  full  text  of  the  documents 
being  cataloged  as  soon  as  the  text-access  equipment  becomes  operational. 
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D.  THE  COMPUTER-STORED  AUGMENTED-CATALOG  PROGRAM 

I.  AUGMENTED-CATALOG  INPUTTING 

Staff  Members 

Mr.  A.  R.  Benenfeld 
Mrs.  S.  K.  Escudier 
Mrs.  E.  J.  Gurley 
Mrs  .  S  .  V  .  Lage 
Mis 3  L.  T.  Lee 
Mrs.  J.  R.  Marshall 
Miss  V.  A.  Miethe 
MissS.  P.  Niessen 
Professor  J.  F.  Reintjes 
Miss  M.  E.  Sullivan 

Cataloger  Assistants 

Mrs.  A.  M.  Davis 
Mrs.  C.  U.  Koss 

Graduate  Students 

Mr.  N.  A.  Clark 
Mr.  M.  S.  Hsu 
Mr.  R.  Ray 

SUMMARY 

Literature  from  three  additional  research  areas  in  materials  science  and  engineer¬ 
ing  is  being  added  to  the  data  base.  Changes  have  been  made  in  the  specifications  for  two 
data  elements;  the  coding  notation  for  special  characters  has  been  expanded.  Modifica¬ 
tions  to  the  indexing  process  have  reduced  the  extent  of  word  redundancy  in  a  set  of  index 
terms.  A  study  of  the  current  economic  feasibility  of  online  teletypewriter-terminal  key¬ 
ing  with  respect  to  offline  paper-tape  keying  of  cataloging  data  shows  the  online  mode  to 
be  twice  as  expensive  as  the  paper-tape  mode;  the  online  mode  also  is  sensitive  to  the 
particular  computer  system  environment.  A  thesis  study  of  the  effects  of  indexer  experi¬ 
ence  over  time  on  the  subject-indexing  time  per  document  page  presents  evidence  of  an 
average  learning  period  of  three  months'  duration  for  librarians  and  two  months'  dura¬ 
tion  for  students.  Average  processing  times  for  current  operations  are  tabulated.  As  of 
September  1,  1968.  6928  documents  have  been  indexed. 

DATA  BASE  AND  LITERATURE  SELECTION 

Previously  the  literature  from  only  two  research  areas,  which  reflected  the  inter¬ 
ests  of  two  research  groups  in  materials  science  and  engineering,  had  been  selected  for 
the  augmented -catalog  data  base.  Literature  from  additional  research  areas  has  now 
been  added  but  on  the  basis  of  modified  criteria.  The  interests  of  the  two  initial  re¬ 
search  groups  are  being  considered  as  dual  cores.  Additional  research  groups 
must  overlap  in  their  interests  with  one  or  the  other  of  the  core  groups,  in  addition 
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Undergraduate  Students 
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to  meeting  other  criteria  for  the  selection  of  a  group  (see  Semiannual  Activity  Report 
for  16  March  f  967  )  .  In  this  manner  the  number  of  potential  system  users  is 

increased  and,  at  the  same  time,  the  adequacy  of  the  data  base  is  increased  without  un¬ 
duly  enlarging  that  base.  Three  additional  research  groups  have  been  selected  to  par¬ 
ticipate  in  the  catalog  experiments,  "he  research  groups  in  the  two  core  areas  are: 

Core  A.  1.  Rad  infrequency ,  microwave,  and  optical 
spectroscopy  of  liquids  and  sciids 

2.  Microwave  and  quantum  magnetics 

Core  B .  1  .  High-temperaturc  metallurgy 

2.  Casting  and  •■olidifii' ation 

3.  Structural  materials 

We  are  continuing  to  route  journal-issue  tables  of  contents  to  two  members  of  each 
reseaich  group;  in  return  for  this  current -awareness  service,  they  indicate  articles 
which  they  consider  important  to  their  area.  All  such  articles,  plus  any  obvious  over¬ 
sights  caught  by  a  librarian,  are  accepted  6  *  the  d  ta  base.  In  addition  to  the  journal 
literature,  the  data  base  includes  conference  proceedings  in  the  core -subject  areas,  and 
the  technical  reports  and  theses  issued  by  the  participating  research  groups.  When  a 
sufficient  data  base  of  this  prime  literature  has  been  established,  other  literature  forms 
will  be  added  to  round  out  the  catalog. 

PROCESSING 

No  significant  changes  have  been  made  in  the  workflow;  it  remains  essentially  as 
described  in  the  preceding  Semiannual  Activity  Report.  However,  the  addition  of  the 
literature  of  interest  to  three  new  research  groups  led  io  several  minor  changes  in  the 
prec  dures  associated  with  distribution  of  tables  of  contents  and  with  microfilm  initia¬ 
tion  . 

About  4500  documents  that  had  been  cataloged  before  the  tolerances  in  the  current 
microfilming  operation  were  established  have  now  been  filmed  for  microfiche  prepara¬ 
tion.  The  a.  ss  numbers  for  the  documents  stored  on  these  fiche  will  be  added  later  to 
the  machine -stored  catalog  records  through  an  update  operation. 

Some  documents  selected  for  (he  data  base  will  not  be  filmed  for  microfiche  text 
storage.  Such  documents  include  those  whose  length  exceeds  60  pages  (or  one  fiche, 
there  being  60  frames  per  fiche),  those  of  temporary  nature  (such  as  preprints),  and 
those  of  very  poor  graphic  quality.  However,  users  will  have  guaranteed  hard-copy 
text  access  to  there  documents. 
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As  of  September  1,  1968, 


6428  documents  have  been  indexed, 

6320  records  have  been  reviewed, 

5295  records  have  been  keyed, 

485  files  have  entered  the  computer  ar.d  codes 
converted  from  Ftexowr->er  to  ASCII  codes, 

454  files  have  passed  through  the  first 
proofreading  and  editing  process, 

212  files  have  passed  through  three  or  more 
proofreading  and  editing  processes, 

400  files  have  been  certified  for  further 
processing  into  the  stored  data  base. 


INDEXING 

The  student-indexer  program,  described  in  detail  in  the  preceding  Semiannual 
Activity  Report,  has  continued  to  be  quite  satisfactory.  Ten  students  were  employed  on 
a  part-time  basis  during  the  Spring  semester.  During  the  summer,  five  students  were 
employed;  two  of  these  students  worked  on  a  full -week  basis. 

The  costs  associated  with  computer  storage  of  data  are  high.  Temporary  storage 
for  processing  and  final-storage  requirements  are  particularly  large  for  the  subject- 
index  terms.  These  terms  are  stored  in  the  main  catalog  file  as  full  terms  and  in  the 
subject/title  inverted  file  both  as  stemmed -word  full  terms  and  as  phrase -decomposed 
and  stemmed  individual  words.  In  an  effort  to  reduce  the  processing  and  storage  costs 
without  sacrificing  the  basis  of  our  experiments,  we  have  more  closely  examined  the 
indexing  process  in  light  of  our  software-retrieval  capabilities.  As  a  result,  several 
modifications  have  been  made  to  the  indexing.  The  modifications  eliminate  excessive 
redundancy  within  a  set  of  index  terms;  at  t.he  same  time,  the  basic  design  of  the  index¬ 
ing  and  retrieval  processes  remains  unchanged.  It  may  be  helpful  to  review  the  nature 
of  the  indexing  before  discussing  the  specific  modifications  to  it. 

The  set  of  index  terms  describing  a  document  are  freely  chosen  combinations  of 
phrases  based  upon  the  text  of  a  document.  Each  term  is  structured  to  provide  sufficient 
contextual  expression  such  that  the  term  stands  by  itself.  This  is  often  achieved  by  in¬ 
tensifying  text  phrases.  Each  index  term  is  weighted.  As  explained  in  preceding  re¬ 
ports,  there  are  two  kinds  of  weights.  One  type  of  weight  reflects  the  relative  proportion 
of  a  document  devoted  to  a  discussion  of  the  represented  concept.  Here, a  weight  of  1 
signifies  that  the  term  is  descriptive  of  most  of  the  content  of  an  entire  document;  a 
weight  of  2  indicates  that  the  term  characterizes  a  section  of  the  document;  and  a 
weight  of  3  refers  to  a  term  that  describes  a  small  fraction  of  the  document.  The  other 
kind  of  weight  is  a  special-purpose  weight  assigned  irrespective  of  extent  of  discussion 
of  the  topic  designated  by  the  term.  The  special  weight  of  4  is  assigned  to  terms 


representing  mathematical  tools  ,  instrumental  tools  ,  materials,  or  applications  ; 
the  special  weight  of  0  (zero)  is  reserved  for  terms  that  are  generic  to  the  subject 
matter  of  a  document.  Previously,  an  index  term  might  have  been  eligible  to  receive 
two  weights,  a  weight  from  the  regular  series  indicating  the  extent  of  discussion,  and 
if  applicable,  a  special  series  weight.  Whenever  dual  weights  were  assigned,  the  en¬ 
tire  index  term  was  repeated. 

At  present,  the  software  search  and  relevancy  algorithms  do  not  utilize  the  as¬ 
signed  and  stored  weights,  and  so  it  is  redundant  to  assign  more  than  one  weight  to  a 
term.  In  the  future,  the  software  will  utilize  these  term  weights  in  the  determination 
of  the  relevancy  of  an  index  term  associated  with  a  particular  document  to  a  retrieval- 
request  term.  In  this  case,  assignment  of  more  than  one  weight  to  a  term  is  still 
redundant  because  only  one  of  the  weights,  the  higher  ordered  one,  will  be  considered 
in  determining  the  priority  of  the  term.  The  practice  of  assigning  two  weights  to  the 
same  term  has  been  dropped  from  the  indexing  process.  Because  the  regular  weights 
have  been  given  greater  emphasis  than  the  special  weights,  the  definition  of  the 
special-purpose  weights  has  been  revised  such  that  they  are  now  applied  only  to  terms 
which  are  not  eligible  for  a  regular  weight.  In  essence,  this  means  that  special- 
purpose  weights  are  now  assigned  only  to  terms  that  are  not  discussed,  per  se,  in  a 
document. 

In  addition,  the  phrase-decomposition  retrieval  programs  also  make  redundant 
any  index  term,  which  appears  as  a  string  within  any  other  index  term  that  has  the  same 
or  a  higher  ordered  weight.  Consequently,  for  example,  the  creation  of  an  index  term 
consisting  of  the  name  of  a  material  has  been  retained  only  if  that  name  does  not  also 
appear  within  another  term  with  a  higher  ordered  weight. 

These  two  modifications,  the  elimination  of  dual-weighted  terms  and  the  elimina¬ 
tion  of  terms  also  appearing  as  a  string  within  another  term,  do  cause  some  loss  of 
information.  A  user  will  not  be  able  to  narrow  his  subject-search  requests  solely  on 
the  basis  of  a  single  weight  without  losing  pertinent  references.  However,  other 
searching  strategies  can  overcome  chis  problem  because  the  words  themselves  have 
been  retained  in  the  index;  only  word  redundancies  have  been  eliminated. 

In  another  modification,  separate  terms  which  have  sinului  wording,  which  are 
logically  equivalent,  and  which  have  the  same  assigned  weight,  are  now  combined  into 
a  single  index  term.  This  single  term  has  compound -subject  or  object  sentence  ele¬ 
ments.  This  procedure  eliminates  repetition  of  words  common  to  the  separate  index 
terms.  For  example,  the  separate  terms  "optical-dispersion  curves  for  barium 
titanate  (3)"  and  "optical-dispersion  curves  for  strontium  titanate  (3)"  would  now  be 
combined  into  the  single  term  "optical-dispersion  cvirves  for  barium  titanate  and  stron¬ 
tium  titanate  (3). " 

With  the  elimination  of  these  redundancies,  the  indexing  and  the  software  which 
operates  on  it  are  in  better  harmony;  in  addition,  computer-storage  space  and  cost 
are  conserved. 


DATA  ELEMENTS  AND  CODING 

Additional  codes  have  been  established  for  Field  5,  Access  Number,  which  indi¬ 
cate  the  availability  and  locat.on  of  guaranteed  hard-copy  text  access  to  those  documents 
not  stored  on  microfiche. 

The  specifications  for  Field  66,  Level  of  Approach,  have  been  rewritten.  This 
field  suggests  the  professional  level  of  the  author's  intended  audience,  or  the  minimum 
general  level  of  understanding  required  to  comprehend  the  subject  o.  a  document.  The 
number  of  levels  has  been  reduced  from  seven  to  a  more  easily  identifiable  set  of  three, 
namely,  professional  level,  undergraduate  level,  and  below  college  level. 

The  coding  notation  for  simultaneous  machine-readability  and  human  recognition 
of  characters  not  on  our  Flexowriter  keyboard,  called  special  characters  (see  the  Semi¬ 
annual  Activity  Report  of  15  September  1967),  has  been  expanded.  The  string  of  char¬ 
acters  in  a  superscript  or  subscript  notation  is,  itself,  considered  a  special  character. 
Previously,  we  could  not  encode  such  a  string  if  it  contained  a  special  character  in  a 
nonleading  position.  Square  brackets  now  demarcate  nonleading  special  characters. 

F or  example, 

X  =  A(T-Tc)*' 

becomes 

*chi*  =  A(T  -  T*sub  c*)*sup  -[♦gamma*]* 

This  expanded  notation  also  permits  us  to  code  nested  special  characters  as,  for  ex¬ 
ample,  a  superscript  to  a  superscript. 

EVALUATIONS 

The  average  processing  times  for  current  workflow  operations  are  tabulated  in 
Fig.  B-l.  Other  characteristics  of  these  processes  and  of  the  catalog  records  are  also 
listed  in  Fig.  B-l.  It  takes  an  average  of  68.  Imam-minutes  to  catalog,  review,  key, 
and  first-proofread  a  catalog  record.  In  current  processing,  80  percent  of  the  files 
cycle  through  two  edit  loops  (once  for  editing  and  once  for  certification);  20  percent  of 
the  files  have  errors  to  be  corrected  in  a  second  online  editing  step  and  so  these  files 
are  cycled  through  three  edit  loops.  (A  file  consists  of  10  catalog  records.  )  About 
half  of  the  11.2  average  number  of  errors  per  file  detected  during  a  first  proofreading 
are  typographical  errors;  cataloging  errors  comprise  the  remainder.  As  the  number 
of  errors  per  file  decreases,  the  ratio  of  total  computer-time  consumed  to  console¬ 
time  consumed  in  online  editing  increases.  The  ratio  is  i/36  for  the  first  online  edit 
of  a  file  of  10  records  and  it  is  l/6  for  online  certification  of  an  error-free  file. 

We  can  gain  further  perspective  in  this  analysis  by  considering  the  sire  of  a 
logical  catalog  record  keyed  for  computer  input.  There  are  an  average  2461  keystrokes 
per  record,  exclusive  of  carriage  returns,  tabs,  and  case  shifts,  or  410  six-character 
Engl'sh  words  per  record.  There  are  16.4  index  terms  per  record,  averaging  7.7 


CURRENT  AVERAGE  PROCESSING  TIMES 
and 

ASSOCIATED  AVERAGE  CHARACTERISTICS 


Process 

Average  Time 

Indexing 

Descriptive  Cataloging  (by  Librarian) 

Review 

Keying  plus  Descriptive  Cataloging 
(by  Typist) 

(man- minutes/ record) 

28.  0 

4.  7 

10.  3 

lb.  9 

Proofreading  (by  Librarian) 

On-Line  Editing 

Second  Proofreading  (by  Typist) 

Second  On-Line  Editing  \  of  "the 

Third  Proofreading  (by  Typist)  j  files 
On-Line  Certification 

(man-minutes/file) 

82.  0 

20.  7 

11.5 

4.  1 

1.7 

1.  0 

Associated  Characteristics 

Average 

Computer  Time/Console  Time  Ratio 

First  On-Line  Edit 

0.028  (1/36) 

Second  On-Line  Edit 

0.074  (1/14) 

On-Line  Certification 

0.17  (l/o) 

Errors  Caught  During 

(p«' r  file) 

First  Proofreading 

11.2 

First  Editing 

0.  33 

Second  Proofreading 

2.  0 

(per  record) 

Keystrokes 

2461 

English  Words 

410 

Index  Terms 

16.  4 

English  Words/Index  Term 

7.  7 

(pages) 

Document  Length 

5.  3 

Fig.  B-1  Averoge  Processing  Time*  of  Current  Workflow  Operation!  and  Associated  Average 
Characteristic!  (January -April,  1968).  One  file  contain!  ten  records. 
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English  words  per  index  term  in  length.  At  present,  the  data  base  consists  almost 
entirely  of  journal  articles  or  conference  papers,  and  in  the  survey  sample  these  doc¬ 
uments  averaged  5.3  pages  in  length.  The  set  of  index  terms  comprises  about  l/ 3  the 
total  size  of  a  catalog  record,  an  abstract  or  excerpt  comprises  about  2/5  of  the  record 
size,  and  all  other  applicable  data  elements  account  for  the  remainder  of  the  catalog 
record. 

We  have  made  a  study  of  the  economic  feasibility  of  online  teletypewriter  - 
terminal  keying  with  respect  to  our  normal  offline  paper-tape  keying  of  the  cataloging 
data.  Six  files  of  catalog  records  were  keyed  online  by  a  typist  working  at  an  IBM  2741 
terminal  connected  to  the  7094  computer.  The  oni.ne  input  was  cost-compared  to  that 
for  another  six  files  input  by  the  paper-tape  medium.  The  analysis  of  each  input  mode 
included  all  of  the  computer  operations  through  the  generation  of  a  first  printout.  The 
data  are  summarized  in  Fig.  B-2;  they  are  valid  only  for  the  current  (March,  1968) 
M.I.T.  Compatible  Time-Sharing  System  (CTSS)  environment.  The  unit-cost  figures 
for  typist  salary  (including  overhead),  and  keying  machine  rental  costs,  are  pro-rated 
over  20  working  days  per  month,  seven  hours  per  day.  Materials  associated  with  on¬ 
line  input  are  negligible  in  cost  and  there  is  no  direct  charge  to  us  for  printout.  The 
analysis  shows  that  the  cost  of  online  input  of  data  to  the  augmented  catalog  in  the  cur¬ 
rent  CTSS  environment  is  $41. 07  per  file  of  ten  records  whereas  the  cost  of  paper-tape 
input  is  $23.46  per  file.  The  cost  differential  is  due  almost  entirely  to  the  computer 
time  and  cost  required  to  process  data  from  the  online  terminal  (205.4  seconds  versus 
6.4  seconds).  All  other  operations  in  the  two  input  modes  are  nearly  equivalent  in  time 
and  cost.  Two  computer  programs  (designated  EDA  and  QED)  were  used  in  this  experi¬ 
ment.  Each  program  operated  on  a  part  of  the  file  set  which  was  input  online.  There 
was  essentially  no  difference  in  the  total  computer  time  expended  with  each  program. 
The  ratio  of  computer  time  to  console  time  consumed  was  0.  018  for  EDA  and  0.  016 
for  QED.  However,  the  components  of  the  total  computer  time  are  program  sensitive. 
There  is  a  trade-off  between  proce:.sing  time  (the  time  for  programs  to  operate  on 
data  in  core)  and  swap  time  (the  time  to  move  programs  and  data  into  and  out  of  core 
in  a  multiprocessing  system).  The  ratio  of  proces.’.r.g  time  to  swap  time  is  0.43  for 
EDA  and  0.  27  for  QED. 

As  the  hardware  associated  with  the  Intrex  System  becomes  available,  and  as  the 
computing-system  environment  changes,  online  input  will  be  restudied;  particular 
attention  will  be  given  to  the  effects  of  using  a  local  buffer/controller  which  is,  in  turn, 
connected  to  the  larger  computer  system. 

In  another  study,  Mr.  Richard  Lufkin,  in  fulfillment  of  his  Bachelor  of  Science 
thesis  requirement  at  M.I.T.  ,  made  determinations  of  the  effects  of  indexer  experi¬ 
ence  over  time  on  the  subject-indexing  time  per  document  page.  The  data  are  scattered 
but  seem  to  indicate  an  average  learning  period  of  three  months'  duration  for  librarians 
and  of  two  months'  duration  for  students.  At  the  end  of  this  learning  period,  indexing 
time  per  document  (averaged  over  all  indexers)  levels  off  at  six  to  eight  .ninu  es  per 


COST  ANALYSIS  OF  ON  LINE  INPUT  AND 
PAPER  TAPE  INPUT 


On-Line  Input 

Paper  Tape  Input 

Ope  ration 
(Unit  Cost) 

Time  per 
File 

Cost  per 
File 

Time  per 
File 

Cost  pe  r 
File 

Keying  Operator 

(minute  s) 

(minute  s) 

Preparation 

27.0 

30.0 

Log  In 

2.4 

Input 

174.3 

164.2 

Total  Keying 

( $0 . 095/min . ) 

1 

203.7 

$19. 35 

194.2 

$18.45 

Compute  r 

1  ( seconds) 

( seconds) 

Log  In  Subtotal 

8.4 

Proce  s  sing 

7.1 

Swap 

1.3 

Input  Subtotal 

205.4 

6.4 

Proce  s  sing 

54.1 

3.4 

Swap 

151.3 

3.0 

Magnetic  Tape  Gen¬ 
eration,  Printout  and 
Miscellaneous  Oper¬ 
ations  Subtotal 

6.5 

8.0 

Processing 

5.6 

6.8 

Swap 

0.9 

1.2 

Total  Computer 

($0, 083/sec. ) 

220.3 

$18.28 

14.4 

$1.20 

Computer  Operator 

( $0 . 055/min . ) 

(minute  s) 
3,2 

$0.18 

(minute  s) 
7.0 

$0.39 

Keying  Machine  Rental 

(minute  s) 

(minute  s) 

IBM  2741  and  associ¬ 
ated  equipment 

($0.01 6/min . ) 

203.7 

$3.26 

Fie xow rite  r  2 30  3 

($0.01  4/min . ) 

194.2 

$2.72 

Mate  rials 

$0.70 

TOTAL  COST 

$41.07 

$23.46 

Fig.  6-2  Co«t  Anolyiii  of  On-Lin#  Input  ond  Paper  Tape  Input  of  Catalog  Data  to  a  7094  Computer 
Operating  in  the  M.I.T.  CTSS  Environment  (Morch,  1968).  One  file  contain*  ten  record*. 
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page  from  an  initial  high  of  ten  to  twelve  minutes  per  page.  In  another  phase  of  this 
thesis,  the  average  indexing  time  per  page  was  found  to  be  influenced  by  document- 
related  parameters.  There  were  observable  differentials  of  the  per-page  indexing 
time  among  documents  grouped  by  format.  For  example,  the  ease  of  indexing  articles 
decreases  in  this  order:  conference  paper,  regular  journal  article,  letter-type 
journal  article. 


2.  STORAGE  AND  RETRIEVAL 

Staff  Members 

Mr.  C.  E.  Hurlburt 

Mr.  P.  Kugel 

Mr.  R.  L.  Kusik 

Mr.  R.  S.  Marcus 

Mr.  M.  K.  Molnar 

Professor  J.  F.  Reintjes 

Professor  A.  K.  Susskind 

Mr.  ?T.  F.  Vandevenne  (visiting) 


SUMMARY 

As  described  in  preceding  Semiannual  Activity  Reports,  the  implementation  of 
our  experimental  augmented -catalog  storage-and-retrieval  computer  programs  has 
been  scheduled  in  a  research  program  of  three  phases: 

Phase  I:  A  restricted  system  for  use  by  the  Intrex  program¬ 

mers  in  investigating  various  tec  hniques  of  file 
organization,  storage,  and  retrieval. 

Phase  II:  A  more  complete  system,  based  on  the  present 

M.  I.  T.  -modified  IBM-7094  time-sharing  comput¬ 
ing  system  (CTSS),  for  conducting  experiments  on 
storage  and  retrieval  in  the  context  of  our  10,000- 
document  augmented  catalog  and  a  selected  group 
of  users . 

Phase  III:  An  expanded  version  of  the  Phase-II  system  to  be 

implemented  on  next-generation  time-sharing 
computers  at  M.I.T. 

The  Phase-I  activity  has  been  completed.  To  date,  very  little  Phase-UI  activity 
has  been  undertaken  since  the  anticipated  new  time-sharing  systems  will  not  be  avail¬ 
able  to  Intrex  as  soon  as  expected.  Therefore,  Intrex  experiments  will  be  conducted 
for  the  time  being  on  the  present  M.I.T.  IBM-7094  time-sharing  system  with  the  Phase 
II  system. 

The  Phase-II  system  has  undergone  rapid  development  in  this  reporting  period 
and  we  now  have  a  working  version  which  has  been  termed  the  'Demonstration  System". 
This  Denv  itration  System  includes  most  of  the  basic  retrieval  functions  and 


user-system  dialog  features  that  have  been  specified  in  previous  Semiannual  reports 
for  our  initial  Phase-U  system,  but  the  data  base  upon  which  it  operates  is  small-- 
48  documents.  This  system  has  not  only  permitted  us  to  demonstrate  lntrex  --  demon¬ 
strations  have  been  given  via  telephone-communication  links  at  a  recent  conference  in 
West  Berlin --but  also  to  experiment  with  actual  user  reaction  to  system  operation. 

Our  next  major  goal  is  to  expand  the  online  data  base  to  sufficient  size  (perhaps 
1000  documents)  so  that  meaningful  experiments  can  start  with  users  who  have  a  real 
need  to  extract  information  from  the  system.  At  the  same  time  we  shall  be  evaluating 
and  improving  system  performance  and  preparing  for  the  incorporation  of  the  display- 
console  and  text-access  systems. 

In  addition  to  the  above  work,  which  is  intended  as  a  basis  for  experiments  in  the 
near  future,  a  study  of  a  storage  system  for  a  much  larger  catalog  is  under  way. 

DESCRIPTION  OF  DEMONSTRATION  SYSTEM 

The  Demonstration  System  permits  the  user  to  search  the  data  base  for  references 
to  documents  by  specifying  subjects,  authors  and/or  titles.  The  user  may  then  make  a 
selection  among  the  documents  thus  retrieved  by  requesting  that  they  contain  specified 
information  in  certain  fields  of  the  documents'  catalog  records.  Finally,  he  may  re¬ 
quest  that  the  information  contained  in  any  or  all  of  the  catalog  fields  be  printed  out. 

In  order  to  illustrate  the  nature  of  the  Demonstration  System  more  concretely,  a 
sample  user-system  dialog  is  given  in  Fig.  B-3.  This  dialog  is  reproduced  from  the 
typewritten  copy  prepared  on  an  IBM  2741  teletypewriter  console  attached  to  the  time¬ 
sharing  system  (CTSS).  For  illustrative  purpose  of  this  report  each  user  statement 
is  flagged  by  a  number  and  the  letter  JJ  in  the  left  margin.  Similarly,  system  mes¬ 
sages  are  flagged  by  numbers  and  the  letter  Sj  . 

In  his  first  two  statements  the  user  has  logged-in  to  the  time-sharing  system 
(CTSS)  which  hosts  lntrex  as  well  as  many  other  computer  programs.  Note  that  the 
user's  second  statement,  his  password  to  CTSS  is  not  printed  because  the  274  1  is 
changed  to  nonprinting  mode  by  CTSS  for  that  statement  to  protect  the  security  of  the 
password.  This  log-in  procedure  will  be  unnei  essary  for  individual  users  for  consoles 
dedicated  to  lntrex  use. 

The  third  user  statement  "resumes  ”  the  "lntrex"'  system  (at  this  point  the  user 
could  have  called  for  any  other  program  currently  resident  in  CTSS)  and  in  the  fourth 
statement  the  user  "signs  in”  to  lntrex  by  typing  his  name  and  address.  The  "sign-in" 
statement,  in  conjunction  with  the  monitoring  procedures  (see  below),  provides  us,  as 
system  analysts,  with  a  means  for  keeping  track  of  system  use.  It  also  serves  to  in- 
tri  'uce  the  user  to  certain  system  procedures.  For  example,  the  user  is  apprised  of 
the  fact  that  his  statement  must  be  terminated  with  a  carriage  return  (note  that  a  more 
natural  statement  -terminator  button  or  switch  will  be  possible  with  the  display  cons- 
sole  --  see  Section  B.  3).  While  at  present  the  sign-in  statement  merely  serves  to  aid 
in  monitoring  system  use.  we  anticipate  future  system  developments  whereby  some 
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Fig.  B-3  Sample  Demonstration  System  Dialog 


IS  TST7X5 :  USERS-  16,  MAX-  47. 

READY . 

W  login  m5806  mercus 
2S  W  1J1S. 7 
Password 
2U 

3S  STANDBY  LINE  HAS  BEEN  ASSIGNED 

M5806  160  LOGGED  IN  08/19/68  1315,9  FROM  800277 

LAST  LOGOUT  WAS  08/15/68  944.4  FROM  800277 

HOME  FILE  OIRECTORY  IS  M5806  CMFL01 

DUE  TO  HARDWARE  DIFFICULTIES,  CTSS  OPERATION  MAY  BE 
IRREGULAR. 

CTSS  BEING  USED  IS  TST7X5 
R  6.166*1.016 

30  resume  Intrex 
4S  W  1316.8 

Greetings!  This  Is  Intrex  la.  Please  sign  In  hy  typing  your  name  and 
address  as  In  the  following  example: 

smith,  r  j/mlt  13-5251 

Note  that  your  slgn-ln  statement  should  end  with  a  carriage  return. 

READY 

4U  marcus,  r  s/mlt  35-406 

5S  If  you  already  know  how  to  use  Intrex,  you  may  go  ahead  and  type 
In  commands.  (Remember,  each  command  ends  In  a  carriage  return.) 

Otherwise,  for  Information  on  how  to  make  simple  queries  of  the 
catalog,  type 

Info  2 

or,  to  see  the  Table  of  Contents  (Part  1)  of  lntrex-l  Guide  which  will  t 

direct  you  to  other  parts  of  the  Guide  explaining  how  to  make  more  detailed  j 

queries,  type  j 

t 

Info  1 
READY 

50  Info  2  , 

6S  Part  2  of  Intrex  1A  Guide:  Simple  Queries 


To  find  documents  In  the  system  specify  your  query  hy  subject,  author, 
and/or  title  terms,  as  shown  In  the  3  examples  below: 

subject  ferroelectric  transitions 

authc  '  Mess,  G.B. /subject  helium 

title  sul fur  I  tat  lon/author  Swisher 

In  order  to  specify  additional  restrictions  (e.g.,  where  author  comes 
from,  journal,  word  variations  not  to  use,  etc.),  see  Part  9  of  the  Guide 
(or  type  Info  9).  For  other  than  standard  output  (document  numbers,  titles, 
and  authors)  see  Part  I.  For  general  Intrex  command  format  and  abbreviations 
see  Parts  6,1  and  6,2, 

To  see  Table  of  Contents  for  Intrex  la  Guide  and  how  to  use 
the  Guide  on  line,  type 
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Fig.  B-3  (cont'd) 

Info  1 

Otherwise,  you  nay  now  make  slmnte  Queries  or  use  any  other  command. 

READY 

6U  subject  solid  phase  transitions 

7S  A  search  on  your  query  SUBJECT  solid  phas-e  transit-ions  found  7  documents. 
To  output  the  catalog  fields  DOCUMENT  NUMBER,  TITLE,  AUTHOR  on  those 
documents  type 


£0 

This  output  will  take  about  IS  seconds  per  document.  You  may  terminate 
this  output  at  any  time  by  hitting  the  ATTN  key  ONCE.  Otherwise,  you  may 
change  your  output  request.  For  Information  see  Part  8  of  Guide  or  type 

Info  8 

or  change  your  field  restriction  (s r»  Part  9.5)  or  make  another  request  of 
Intrex  (see  Part  1) 

READY 
?U  go 

ffS  1.  DOCUMENT  NUMBER  2851 

(21'  AUTHOR 

Hoshino,  Sadao; 

Sh Imaoka ,  Koh J I  ( J  A  ) ; 

N I Imura,  Nobuo  (JA) 

( 24  )  TITLE 

Fer roe  I ec t r I c 1 1 y  In  solid  hydrogen  halides 

2.  DOCUMFNT  NUMBER  5450 

(21)  AUTHOR 

Slhvonen,  Y.T. 

(24)  TITLE 

Photo  I un I ne scence ,  photocurrent,  and  phase- t rans ! t I  on  correlations 

5.  DOCUMENT  NUMBER  5174 

(21)  AUTHOR 

Stanley,  H,  Eugene  (TA) 

(25)  CORPORATE 

Harvard  University.  ' Camh r I dre \ ,  'Mass.<  (RT) 

(24)  TITLE 

H I gh- temper  a t u re  expansions  for  the  classical  Heisenberg  'model, 
suscen t I h ’ I  I t y 

4.  DOCUMENT  NUMBER  5174 

Cl)  AUTHOR 

Sc Nm I d t ,  Hart  land  H.; 

Ondycke,  Jack  (TA)  (JA); 

C»u,  Char  I es  E .  ( JA ) 

(25)  CORPORATE 

University  of  >Ca 1  I f orn I  a < ,  Rlvers!de<  (BT) 

(24)  TITLE 

Heat  capacity  in  the  critical  region  o'  xenon 

5.  DOCUMENT  NUMBER  5041 
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Fig.  8-3  (  cont'd  ) 


(21)  AUTHOR 

Nov  I kova ,  S  .  I  . ; 

Shell mova ,  L .  E  .  ( JA ) 

(24)  TITLE  | 

Low- 1 empe ra tu re  phase  transition  in  tin  teliurlde  * 

i 

6.  DOCUMENT  NUMBER  1715  l 

(21)  AUTHOR 

Ne  C 1 1 e ton ,  R .  F  . 

(24)  TITLE 

Renormalization  and  the  YIrst-order  fe r roe  1 ec t r I c  transition  In 

7.  DOCUMENT  NUMBER  1690 

(21)  AUTHOR 

W I  I  I ens ,  R.H.; 

Buehler,  E.  (JA); 

Matthias,  B.T.  (JA) 

(24)  TITLE 

Superconductivity  of  the  t rans 1 1 1 on-me t a  I  carbides 

Output  completed.  Total  of  7  documents  found.  You  may  now  s*e  j 

additional  output  on  these  documents  by  making  a  new  'output'  recjuest  (for 

Information  on  how  to  do  this,  see  Part  8  of  the  gu 1 de  or  type  Info  8).  j 

You  nay  also  select  a  portion  of  these  documents  by  making  a  new  'Infield'  j 

request  (see  Part  9.5).  Otherwise,  you  may  make  a  new  search  (see  Part 

2)  or  make  other  requests  (see  Part  1).  j 

READY  i 

8U  output  affiliation  matchsub  relevance/go 
9S  1.  DOCUMENT  NUMBER  285  1  ;  RE  LF VANCE  5/5 

(22)  AFFILIATION 

University  of  >Tokyo<.  Institute  for  Solid  State  Physics; 

University  of  ’’Tokyot.  Institute  for  Solid  State  Physics; 

University  of  >Tokyo<.  Institute  for  So'ld  State  Physics 

(74)  MATCHSUB 

nhase  transition  at  low  temperature  In  solid  hydrogen  halides  (0) 

2.  DOCUMENT  NUMBFR  5450;  RELEVANCE  2/5 

(22)  AFFILIATION 

>Te*as<  Instruments,  ,>Callas< 

(74)  MATCHSUB 

(TITLE) 

5.  DOCUMENT  NUMBER  5174;  RELEVANCE  2/5 
(22)  AFFILIATION 

t  yman  Laboratory  of  Physics  Harvard  University,  >C»mbrldgef,  > 

Laboratory  M.I.T.,  s Le* Ingtont ,  >Mass.< 

(24)  MATCHSUB  >; 

relation  of  h I gb- tempe ra t ure  espenslons  for  some  two-d I  mens  I onal 
nearest-neighbor  fer romagne t I c  Interactions  to  phase 

(T»sub  c*  ••/•  0)  (5);  | 

4.  DOCUMENT  NUMBER  5124;  RELEVANCE  2/5  J 

I 

k 


% 


University  of  ->Cal  lfornta<,  >Rlverslde<.  Dept,  of  Chemistry; 
University  of  >Ca1 I fornla<,  >Rlverstde  .  Dept,  of  Chemlstry/Unlv 
>San  01ego<.  Dept,  of  Chemistry; 

:Natlonal  Science  Foundation  Undergraduate  Research  Part !c I  pane :U 
>Cal ! forn I a<,  >ktverslde<.  Dept,  of  Chemistry 

( 74  )  MATCH5UB 

phase-trans I t ton  temperature  of  xenon  (3); 

5.  DOCUMENT  NUMBER  304";  RELEVANCE  2/3,  2/3 

(22)  AFFILIATION 

Balitov  (A. A.)  Institute  of  Metallurgy  Academy  of  Science  of  the 
Baikov  (A. A.)  Institute  of  Metallurgy  Academy  of  Science  of  the 

(74)  MATCHSUB 

low-temperature  phase  transition  In  tin  te'llurlde  (1); 

(TITLE) 

6.  DOCUMENT  N'JM8ER  1715;  RELEVANCE  2/3,  2/3 
(22)  AFFILIATION 

Sand  is  Laboratory,  >Albuauerque<,  >N.M.< 

(74)  MATCHSUB 

second-order  phase  transition  In  perovskltcs  (3); 
first-order  phase  transition  (0); 

7.  DOCUMENT  NUMBER  1690;  RELEVANCE  2/3,  2/3 

(22)  AFFILIATION 

Bell  Telephone  Laboratories,  >Hurray  HUK,  >N,J.<; 

Bell  Telephone  Laboratories,  >Murray  FiMK,  >N.J.<; 

Bell  Telephone  Laboratories,  >Murray  Hlll<,  >N.J.< 

(74)  MATCHSUB 

superconduct  I ng  transition  temperature  as  a  function  of  composlt! 
solution  (3); 

superconducting  transition  temperature  as  a  function  of  composltl 
solution  (3), 

Output  completed.  Total  of  7  documents  Found.  You  may  now  see 
additional  output  on  these  documents  by  making  a  new  ’output'  request  (for 
Information  on  how  to  do  this,  see  Part  8  of  the  guide  or  type  Info  8). 

You  may  also  select  a  portion  of  these  documents  by  making  a  new  'Inf laid 
request  'see  Part  9.3'.  Otherwise,  you  may  make  a  new  search  (see  Part 

2)  or  make  other  requests  (see  Part  1). 

READY 

9U  Infield  affiliation  harvard/o  71/go 
105  1.  DOCUMENT  DUMBER  3174 

(71)  ABSTRACT 

The  hlgh-temperature  series  expansion  of  the  zero-field  magnetic 
•ch I */*chl*«sub  Curie*  *  1  ♦  *S!GMA**sub  1  *  l*sup  *  Inf  I n I ty**a*S 
Is  related  to  the  diagrammatic  representation  of  the  corresponds 
expansion  of  the  zero-field  static  spin  correlation  function 
•*S»suh  f*.S*sub  g****sub  ‘beta**  presented  elsewhere.  The  first 
general  lattice  constants"  p*sub  lx*.  The  general  lattice  expressions  are 
used  to  evaluate  these  a*sub  I*  numerically  for  three  two- 1 1  NT.  1 
Cues  the  crIREAOY 
iOU  quit 

IlS  Thank  you  for  using  Intrex, 
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history  is  kept  to  past  users  so  that,  when  someone  signs  in,  the  system  can  take  ac¬ 
count  of  his  previous  experience  to  help  direct  him  in  current  system  use. 

Intrex  response  to  the  user's  sign-in  statement,  message  5S,  is  illustrative  of 
several  features  of  the  prompting  and  instructional  techniques  employed  by  the  system. 
In  the  first  place,  the  user  is  told  of  the  various  alternative  actions  that  he  may  take  at 
any  given  time.  Secondly,  the  specific  form  of  the  statement  he  should  type  to  invoke 
one  of  these  actions  is  explicitly  stated,  where  possible.  Thirdly,  where  it  is  not  pos¬ 
sible  to  explain  the  alternatives  completely,  the  user  is  referred  to  a  Guide  to  system 
use  for  further  details. 

The  Guide  is  available  both  in  hard-copy  form  and  online.  Selected  pages  from 
the  haru-c.opy  version  are  shewn  in  Fig.  B-4.  The  user  may  request  that  a  section  of 
the  Guide  be  printed  online  by  using  the  "info"  command  (see  user  statement  ?U  and 
system  response  6S).  It  may  be  seen  that  the  Guide  itself  also  attempts  to  use  the 
techniques  of  presentation  of  alternatives,  example,  and  reference  to  mo:-e  detailed 
information.  The  sections  of  the  Guide  are  sized  for  convenient  printing  and  viewing 
online.  Details  of  the  generation  and  maintenance  of  the  Guide  are  given  below  in  the 
discussion  on  system  implementation. 

The  user's  sixth  statement  initiates  a  search  in  the  inverted  files  for  documents 
on  a  given  subject.  Searching  may  also  be  done  on  title  or  author  terms  or  combina¬ 
tions  of  subject,  title  and/or  author  terms.  It  may  be  noted  that  the  form  of  the  user's 
statements  is  a  compromise  between  the  precise  but  esoteric  and  complicated  form  of 
many  programming  languages  and  the  familiar  but  ambiguous  (and,  therefore,  difficult 
to  interpret  automatically)  form  of  natural  English.  Command  and  argument  names 
(see  Part  6.Z  of  Guide  in  Fig.  B-4)  are  simple  and  mnemonic.  Format  is  kept  simple 
with  only  three  basic  delimiters  required:  spaces  to  separate  arguments  from  each 
other  and  from  command  names;  slashes  to  separate  commands;  and  a  carriage  return 
to  terminate  the  statement. 

In  response  to  the  user's  search  request  Intrex  replies  with  a  message  (7S)  illus¬ 
trative  of  several  other  features  of  system  dialog.  In  the  first  place,  the  system  plays 
back  its  understanding  of  the  user's  statement.  Also  the  system  indicates,  by  hyphen¬ 
ating  word  endings,  how  it  has  stemmed  the  words  in  the  user's  search  specification. 
This  is  important  because  Intrex  matches  these  word  stems  to  word  stems  in  the  in¬ 
verted  file  (see  preceding  Semiannual  Activity  Report).  As  a  further  indication  of  the 
progress  of  the  retrieval  process,  the  number  of  matching  documents  found  in  the  in¬ 
verted  files  is  printed.  Since  the  user  made  no  special  output  request  in  statement  6U, 
Intrex  reports  the  estimated  time  to  output  the  standard  catalog  fields.  This  system 
message,  then,  gives  feedback  which  may  interact  with  the  user's  original  intentions 
and  expectations  and  allow  him  to  redirect  his  search. 

The  points,  at  which  the  system  reports  to  the  user,  have  been  chosen  in  light  of 
the  operating  characteristics  of  the  host  CTSS  time -sharing  system,  The  intention  is 
to  report  back  soor.  enough  so  that  the  user  experiences  quick  response  but  not  to 


Fig.  B-4  Sections  from  Intrex  Guide 


Part  1  of  intrex  1A  Guide:  Table  of  Contents 


To  have  a  part  of  the  Guide  printed  out  on  line  use  the 
"Info"  command.  For  example,  for  Information  on  making 
simple  queries  (l.e.,  to  print  out  part  2),  type 

I  nfo  2 

PART  CONTENTS 

1  Table  of  Contents 

2  Simple  Queries 

J  General  Remarks  -  How  to  Get  Guide  (printed  copy) 

4  l.og-in  to  CTSS  and  Call  Intrex 

3  Typing  Errors  -  How  to  Correct 

6.1  Commands,  Modes  (LONG,  SHORT),  Time  Checks 

6.2  Command  Names  and  Abbreviations 

7  Preliminary  Output 

8  Final  Output 

9  Generalized  Queries 

10  Scanning  Index  Terms 

11  Interrupting  System  Messages 

12.1  Text  Access 

12.2  Library  Serv  :es 

13  User  Comments  and  Questions 

14  Documents  In  the  Collection 

15  The  Catalog  and  Its  Fields 

16  Sample  Catalog  Record 

17  Exit  from  the  System 

This  online  guide  was  last  revised  on  7/24/68. 
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Part  2  of  Intrex  1A  Guide:  Simple  Queries 


To  find  documents  In  the  system  specify  your  query  by 
subject,  author,  and/or  title  terms,  as  shown  In  the  3 
examples  below: 

subject  ferroelectric  transitions 
author  Hess,  G.B. /subject  helium 
title  sul fur Izat lon/author  Swisher 

In  order  to  specify  additional  restrictions  (e.g., 
where  author  comes  from,  journal,  word  variations  not  to 
use,  etc.),  see  Part  9  of  the  Guide  (or  type  Info  9).  For 
other  than  standard  output  (document  numbers,  titles,  and 
authors)  see  Part  8.  For  general  Intrex  command  format  and 
abbreviations  see  Parts  6.1  and  6.2. 
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Part  15.73:  SUBJECTS 


Terms  describing  the  subject  content  of  the  document 
are  entered  here.  A  term  may  be  a  single  word  or  a  phrase. 
Terms  are  commonly  phrases  used  by  the  author.  Eash  term  Is 
assigned  a  'range'  as  described  In  Part  9.2.  Normally, 
subject  specification  Is  made  In  the  primary  part  of  the 
query  as  described  In  Part  9. 

Journal  articles  are  Indexed  by  an  average  of  about  20 
terms.  Each  term  has  an  average  of  about  5  words. 


Part  15.74:  Matching  Subject  Terms  (MATCHSUB) 


This  Is  not  an  actual  field  In  the  catalog  but  calling 
for  It  as  output  (see  Part  8)  will  cause  the  subject  terms 
that  matched  the  user's  query  to  be  Included  In  the  output. 
(See  Part  15,75  for  criterion  for  matching;  .e., 
RELEVANCE. ) 

If  the  matching  term  Is  the  title  the  word  "(TITLE)"  Is 
output.  Otherwise,  this  field  Is  simply  a  subset  of  field 
73  (SUBJECTS). 

Since  this  Is  not  an  actual  field,  an  INFIELD  request 
would  be  Inappropr late. 
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Part  15.75:  RELEVANCE 


This  Is  not  an  actual  field  In  the  catalog  but  calling 
for  It  as  ouptut  (see  Part  8)  will  cause  the  estimated 
relevance  of  each  matching  subject  term  (see  Part  15.74)  to 
be  Included  In  the  output. 

At  present,  relevance  Is  estimated  as  the  ratio  of  the 
number  of  Individual  English  words  in  the  nwr  subject 
search  request  that  actually  match  (l.e.,  .  ~>n  word 
stems)  for  a  given  subject  term  In  the  catalog  ie  total 
number  of  words  In  the  subject  search  request. 

The  present  criterion  for  matching  depends  on  the 
number  of  words  In  the  subject  search  request.  If  there  are 
one  or  two  words,  each  must  match  If  there  are  three  or 
four  words,  no  more  than  one  can  fall  to  .<iatch  (l.e.,  at 
least  2/3  or  3/4).  If  there  are  more  than  four  words  In  the 
subject  search  request,  at  least  three  of  them  must  match 
words  In  the  given  subject  term. 


Fig.  B-4  (cont'd) 

Part  3  of  Intrex  1A  Guide:  General  Remarks 


Intrex  1A  Is  an  Initial  demonstration  version  of  a 
system  for  automatic  retrieval  of  Information  from 
libraries.  For  this  demonstration  the  data  base  contains 
Information  on  48  journal  articles  In  selected  areas  of 
Materials  Science  and  Engineering  (Including  spectroscopy  of 
liquids  and  solids  and  high  temperature  metallurgy).  The 
data  base  Includes  all  the  Information  usually  found  In  card 
catalogs  plus  much  additional  Information  like  abstracts, 
level  of  approach,  and  author's  purpose.  For  details  on  the 
data  base  see  Parts  14,  15,  and  16  of  this  Guide.  You  may 
Interrogate  this  data  base  to  find  references  to  documents 
covering  broad  or  specific  questions  of  Interest. 

How  to  use  Intrex  1A  Is  explained  In  "Intrex  1A  Guide" 
for  which  this  Is  the  Introduction.  Contact  Project  Intrex 
at  M.l.T.  Room  35-416,  ext.  2354,  to  obtain  copies  of  this 
guide  or  other  Information  about  Intrex.  This  guide  Is  also 
available  online  at  the  computer  as  explained  In  Part  1. 

References  on  Project  Intrex  Include  the  Semiannual 
Activity  Reports  (Call  No.  Z699,  A1.M41.P96)  and  the  Report 
of  the  Planning  Conference  1965  (Call  No.  Z699 . A1 . P712 ) . 
They  are  available  at  the  Engineering  Library 
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Part  6.2:  List  of  Commands 


A  list  of  commands, 
abbreviations)  and  Part  of  Guide 
below: 

ABBREVIATION 


NAME 

Info 

subject 

author 

title 

time 

on 

off 

long 

short 


I 

sh 


PART 

1 

2 
2 
2 

5.1 

6.1 
6.1 
6.1 
6.1 


special  arguments 
In  which  explained 


I  s 


go  g  7 

seematch  sm  7 

output  o  8 

standard  ♦  8 

all  8 

range  r  9.2 

not  9.2 

major  1  9.2 

secondary  2  9.2 

minor  3  9.2 

tool  4  9.2 

generic  0  9.2 

Infield  In  9.4 

scan  10 

library  lb  12,2 

comment  C  13 

exit  ex  17 

quit  q  17 


(and 

given 


-22- 


report  so  often  that  the  user  is  forced  into  unnecessary  additional  responses  of  his  own, 
thereby  forcing  up  not  only  the  overall  real  time  but  also  the  "swap"  time  in  the  computer 
which  has  to  read  back  the  intrex  programs  on  each  interaction.  Of  course,  the  incor¬ 
poration  of  the  buffer  controller  computer  for  the  Intrex  display  consoles  will  improve 
the  operating  characteristics  of  the  time -sharing  environment  and  may  allow  frequent 
feedback  with  less  cost  at  the  central-computer  level. 

In  our  sample  dialog  the  user  takes  the  first  alternative  (statement  7U)  and  the 
system  responds  with  the  standard  output  (message  8S)  for  the  matching  documents. 

At  this  point  in  the  dialog  let  us  assume  that  the  user  already  knows  how  to  make 
an  output  request  or  that  he  refers  to  his  hard-copy  version  of  the  Guide.  In  any  case, 
the  user's  eighth  statement  requests  additional  selected  output  information.  Note  that 
by  stringing  the  "go"  command  along  with  the  "output"  command  the  user  signifies  he  is 
sufficiently  sure  of  his  statement  that  he  does  not  want  Intrex  to  respond  with  its  inter¬ 
pretation  and  timing  estimate  but  rather  to  "go"  on  and  perform  the  requested  output. 

The  system  then  responds  (message  9S)  as  directed.  Note  the  special  output 
information  giving  those  subject  terms  that  matched  (MATCHSUB)  and  the  estimated 
relevance  of  these  terms.  Parts  15.74  and  15.75  of  the  Guide,  which  are  included  in 
Fig.  B-4,  explain  how  relevance  is  estimated  for  these  terms. 

In  statement  9U  the  user  is  selecting,  by  means  of  an  INFIELD  command,  a  sub¬ 
set  of  the  7  documents  which  his  original  search  found.  This  command  enables  the  user 
to  request  only  those  documents  in  which  a  specific  character  string  (here,  "harvard") 
appears  in  a  particular  catalog  field  (here,  "affiliation").  Note  that,  at  the  same  time, 
the  user  is  changing  his  output  request  and  using  abbreviation  "o"  for  the  command 
name  "output"  and  71  for  the  field  name  "abstract". 

In  the  system's  response  (message  10S)  to  the  above  request,  the  user  has  availed 
himself  of  the  interrupt  capability  and  halted  the  output  at  the  point  indicated  by  the 
letters  "INT  1"  (the  additional  characters  on  the  next  line  remain  from  a  program  buffer 
and  would  not  appear  in  a  revised  version  of  the  interrupt  programs  now  being  debugged). 
The  system  then  responds  with  the  READY  message  indicating  the  user  may  go  ahead 
with  other  requests.  The  inters  4  capability  is  a  general  facility  allowing  the  user  to 
cut  short  any  system  messages.  After  the  user  has  become  familiar  with  the  system 
he  can  reduce  the  verbosity  of  system  messages  by  entering  the  SHORT  mode.  He  may 
do  this  at  any  time  during  the  dialog  or  even  upon  resuming  the  system  as  shown  in 
Fig.  B-5. 

MONITORING  SYSTEM  USE 

In  an  experimental  system  such  as  Intrex  it  is  important  to  observe,  unobtrusively 
if  possible,  the  man-machine  interactions,  the  problems  that  arise,  how  the  user  at¬ 
tempts  to  cope  with  them,  his  failures  and  successes,  and  so  forth.  Embedded  in  the 
Phase-II  system  is  an  elaborate  monitoring  system  which  records  all  console  activity, 
that  is,  everything  which  the  user  types  into  the  system,  and  everything  with  which  the 
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system  responds,  as  well  as  information  which  the  system  relates  about  its  internal 
state  as  it  progresses  through  a  console  session.  The  user  is  completely  unaware  of 
this  process  and  should  be  relieved  of  some  of  the  usual  problems  associated  with 
"over-the-shoulder "  observation  sessions. 

The  user  has  at  his  disposal  a  COMMENT  command  which  enables  him  to  make 
a  comment  to  the  system  about  the  system,  the  cataloging,  or  the  documents  in  the 
collection.  These  comments  are,  naturally,  recorded  by  the  monitoring  system.  It  is 
anticipated  that  through  this  monitoring  facility  we  shall  be  able  to  record  and  observe 
such  things  as  which  system  facilities  are  most  effective,  what  difficulties  are  en¬ 
countered  in  the  learning  and  operational  use  of  the  system,  which  fields  within  the 
augmented  catalog  are  most  often  referenced,  and  real-time  response  and  computer 
efficiency  (both  real  time  and  computer  time  used  at  each  interaction  are  recorded). 

See  Fig.  B-6  for  the  first  part  of  a  printout  of  the  monitor  file  created  in  the  sample 
session  illustrated  in  Fig.  B-3. 

EXPERIENCE  WITH  SYSTEM  USE 

The  Demonstration  System  was  operated  successfully  from  West  Berlin  via  tele¬ 
phone-communication  lines  to  the  Project  MAC  CTSS  System  in  Cambridge,  Massachu¬ 
setts  during  a  two-week  conference  on  The  Computer  and  The  University  held  at  the 
Technical  University  of  Berlin  (T.U.B.),  July  22  -  August  2,  1968.  Two  consoles  were  setup 
at  the  conference  to  demonstrate  the  system.  One  was  an  IBM  2741  teletypewriter 
console,  the  kind  employed  as  standard  user  I/O  equipment  for  CTSS.  The  other  was 
an  ARDS  graphical  display  and  keyboard  console  of  the  kind  developed  at  the  Electronic 
Systems  Laboratory  for  Project  MAC.  *  This  latter  console  requires  a  higher  capacity 
communication  link  to  drive  its  faster  rate  of  character  display.  As  it  turned  out,  the 
CTSS  time-sharing  system  and  the  Intrex  retrieval  system  performed  reliably,  but  a 
good  deal  of  trouble  was  experienced  maintaining  the  communication  links  during  the 
course  of  the  communications.  Closed -circuit  TV  monitoring  of  the  ARDS  display  in 
the  demonstration  room  at  T.U.B.  provided  enhanced -viewer  presentation  to  groups 
of  150  persons. 

Controlled  experiments  have  been  perforuved  at  M.I-T.  to  test  the  effectiveness  of  the  user 
prompting  and  instructional  aids  of  the  Demonstration  System  with  real  users.  Six 
users  have  tried  the  system  io  far.  These  users  are  staff  members  of  Project  Intrex 
who  have  not  been  directly  involved  with  the  development  of  the  retrieval  system. 

#  ' 

For  references  to  ARDS  see: 

(1)  R.  H.  Stotr  and  T.  B.  Cheek,  "A  Low-Cost  Graphic  Display  for  a  Computer 
Time-Sharing  Console,’'  M.I.T,  Electronic  Systems  Laboratory  Technical 
Memorandum  ESL-TM-J16,  July,  1967. 

(2)  R.  H.  Stotr,  'A  New  Display  T erminal Computer  Design,  April,  1968. 


Fig.  B-6  Portion  of  Monitor  File 


(MN0819  2773W\  (o8/19/68  1349. 9>^  Vlme  fI!.  wta  prIntW 

40144014  ^  Nome  of  Monitor  Files  include*  date,  console  no.  (277),  and  indication  of  time 


time  Is  17  past  13 
probno.  M5806 
progno.  160 

Greetings!  This  Is  Intrex  la.  Please  sign  In  by  typing  your  name  and 
address  as  In  the  following  example: 

smith,  r  J/mlt  13-5251 

Note  that  your  slgn-ln  statement  should  end  with  a  carriage  return. 

T IMF  ready  4. 0/4.0  13.6/13.6 
READY 

narcus,  r  s/ml t  55-406/ 

If  you  already  know  how  to  use  Intrex,  you  may  go  ahead  and  type 
In  commands.  (Rememher,  each  command  ends  In  a  carriage  return.) 

Otherwise,  for  Information  on  how  to  make  simple  queries  of  the 
catalog,  type 

Info  2 

or,  to  see  the  Table  of  Contents  (Part  1)  of  lntrex-1  Guide  which  will 
direct  you  to  other  parts  of  the  Guide  explaining  how  to  make  more  detailed 
auerles,  type 

Info  1 

TIME  ready  4. 0/8.0  38.4/52.0 

READY 

Info  2/ 

Part  2  of  Intrex  1A  Guide:  Simple  Queries 


To  find  documents  In  the  system  specify  your  Query  by  subject,  author, 
and/or  title  terms,  as  shown  In  the  3  examples  below: 

subject  ferroelectric  transitions 

author  Hess,  G. 8, /subject  helium 

title  sul fur  I zat lon/author  Swisher 

In  order  to  specify  additional  restrictions  (e.g.,  where  author  comes 
from.  Journal,  word  variations  not  to  use,  etc.),  see  Part  9  of  the  Guide 
(or  type  Info  9).  For  other  than  standard  output  (document  numbers,  titles, 
and  authors)  see  Part  I,  For  general  Intrex  command  format  and  abbreviations 
see  Parts  6,1  ar 4  6.2. 

To  see  Table  of  Contents  for  Intrex  la  Guide  and  how  to  use 
the  Guide  on  line,  type 


Info  1 

Otherwise,  you  may  now  make  simple  queries  or  use  any  other  command. 

TIME  ready  10.4/18.4  97.1/149.1 
READY 

subject  solid  phase  transitions/ 

A  search  on  your  query  SUBJECT  solid  phas-e  transit-ions  found  7  documents. 
To  output  the  catalog  fields  DOCUMFNT  NUMBER,  TITLE.  AUTHOR  on  those 
documents  type 


None  of  them  has  any  specific  knowledge  of,  or  interest  in,  the  subject  area  of  the 
Demonstration  data  base  (spectroscopy  of  solids  and  high-temperature  metallurgy). 
Indeed,  with  only  48  documents  this  data  base  is  not  practical  for  meaningful  retrieval 
experiments.  However,  these  users  were  more  or  less  unaware  of  the  specific  user- 
language  features  of  the  Demonstration  System  so  that  they  have  provided  a  good  means 
to  test  the  language  and  the  instructional  and  prompting  features  of  the  system. 

In  general,  the  users  found  the  Demonstration  System  easy  to  use,  though  they 
had  specific  suggestions  for  improvements.  Learning  to  use  the  system  was  signifi¬ 
cantly  aided  by  the  prompting  dialog  and  the  Guide,  but  it  was  evidently  difficult  to 
learn  some  aspects  of  the  system  without  any  human  help.  Users  varied  in  their  ap¬ 
proach  to  the  system.  Some  spent  considerable  time  (over  an  hour)  reading  the  hard¬ 
copy  version  of  the  Guide  before  sitting  down  at  the  console.  Others  spent  little  or  no 
time.  Similarly,  there  were  differences  in  online  behavior  on  such  points  as  extent  of 
use  of  hard-copy  and  online  versions  of  the  Guide,  interrupt  facilities,  „nd  LONG  and 
SHORT  message  modes.  The  experiences  so  far,  while  still  quite  limited,  have  sug¬ 
gested  some  particular  and  general  improvements  tothe  system,  as  discussed  below. 

SYSTEM  IMPROVEMENTS  UNDER  DEVELOPMENT 

A  number  of  system  improvements  have  been  planned  and  are  under  various 
stages  of  development.  A  partial  list  is  given  below. 

Programs  have  been  written  to  display  terms  in  the  inverted  file  and  to  show  how 
many  documents  are  indexed  under  each  term.  These  programs  have  been  debugged 
but  not  yet  integrated  into  the  system.  One  reason  for  this  is  that  computer -core  - 
storage  space  is  at  a  premium.  The  present  programs  consume  about  three -fourths  of 
the  32,000  locations  of  the  core  available  to  programs.  The  remaining  core  area  is 
needed  for  various  tables,  directories,  and  buffers.  Despite  careful  control  of 
dynamic -storage  allocation,  space  is  still  ''tight”.  Therefore,  we  are  planning  an  over¬ 
lay  system  which  will  enable  sections  of  the  system  to  be  read  in  frnrr>  disc  storage  as 
they  are  needed. 

Work  is  progressing  on  improving  the  efficiencv  of  certain  programs.  A  number 
of  programs  were  found  to  take  too  much  time  because  of  the  time  it  takes  to  open  and 
close  disc  files,  for  example,  a  dialog  of  the  kind  given  in  Tig.  B-3  requires  about 
two  minutes  of  computer  time. 

The  experience  with  users  described  above  has  lent  increased  emphasis  to  the 
need  for  improvements  in  the  dialog  which  we  had  previously  been  planning.  These  in¬ 
clude  more  complete  error  messages,  more  examples  in  the  Guide,  and  more  differ¬ 
entiated  dialog  for  different  situations  (for  example,  different  numbers  of  documents 
found). 

One  major  area  under  study  is  the  question  of  matching  algorithms  and  relevance. 
Aa  indicated  above,  matching  it  done  on  a  modified  "anding”  of  all  query  term*.  One 
would  presumably  like  the  uaer  to  have  the  ability  to  specify  any  combination  of 
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ands",  "ors  ",  and  'nots  "  among  query  terms  ;  terms  to  emphasize  within  the  search 
specifications,  online  modifications  ti  the  relevance  and  matching  criteria  (by  term 
ranges  or  weights,  for  example);  and  selective  override  to  the  stemming  algorithm. 
However,  all  these  variations  involve  complexities  in  programming  and  user  language, 
whereas  their  utility  is  undetermined,  especially  in  the  free -vocabulary  indexing  situa¬ 
tion  of  Intrex.  Therefore,  we  shall  try  to  get  some  retrieval  experience  with  the  pres¬ 
ent  system  before  making  extensive  additions  in  this  area. 

Other  improvements  being  planned  include:  decoding  of  catalog  fields  (for  ex¬ 
ample,  "English'1  for  "e  ")  on  output;  more  general  INFIELD  specifications  (for  example, 
ranges  on  dates);  more  accurate  estimates  of  time  to  perform  output;  and  naming  lists 
of  documents  or  arguments  for  later  referral  by  the  user. 

FREE-FORM  INPUT  LANGUAGE 

The  ease  of  adaptation  of  the  user  to  the  language  in  which  he  specifies  his  re¬ 
quests  is  recognized  as  a  vital  factor  in  user  acceptance  of  the  system.  Therefore,  a 
variety  of  approaches  to  this  problem  are  being  considered.  The  approach  that  has 
been  developed  most  fully  is  that  contained  in  the  Demonstration  System,  as  illustrated 
in  the  sample  dialog  of  Fig.  B-3.  This  approach  may  be  characterized  as  being  simple 
but  with  a  relatively  fixed-form  language.  Another  approach  characterized  by  a  free¬ 
form  input  language  is  under  study,  as  described  below. 

The  kind  of  interaction  this  free-form  input  language  is  intended  to  make  possible 
is  indicated  by  the  following  portion  of  a  sample  dialog: 

S(system)  :  If.  at  any  tune  you  wish  assistance,  type  a  question  mark  followed 
by  a  carriage  return.  PROGEED. 

U  (user)  liver  halides 

S  :  4  3  documents  found  under  SUBJECT  SILVER  HAL-IDES.  The  following  <  arc 
most  heavily  devoted  to  your  subject:  (prints  the  title,  author,  and  record  numbers  of 
7  clocuni'  nts ) 

U  :  •* 

S  :  At  this  point,  you  may: 

(1)  Obtain  more  information  about  these  dm  umrnts  . 

(d)  See  more  (  the  relevant  document*  (if  any  remain), 
ti)  Make  a  ncu  request  or  alter  the  given  request. 

J'o  obtain  information  about  any  of  these  alternatives,  type  the  appropriate 
number  (1,  .1  or  i)  followed  by  a  carriage  return.  If  these  alternatives  do  not  rnret 
your  needs  or  you  -a  ,*1:  more  help,  type  a  question  mark  followed  by  a  carnage  return. 

PROCEED. 

U  :  ’ 

S  .  Among  the  additional  items  of  information  that  you  may  obtain  is.  .  . 

Note  that  in  this  sample  dialog,  the  user  never  require  1  to  learn  that  he  should 
specif)  that  he  is  looking  up  a  sublet  t  term  ant!  that  to  do  so  he  should  use  the  word 


subject  or  the  letter  s_.  However,  at  the  point  that  the  dialog  is  broken  off  he  might 
begin  to  learn  such  details  about  the  system.  Even  so,  unlike  the  simple  online  use  of 
the  Guide  the  system  would  decide  for  the  user  which  information  he  is  to  obtain  and  he 
would  be  told  only  about  field  specification. 

Although  such  a  system  can  save  the  user  a  certain  amount  of  trouble,  it  also  in¬ 
curs  certain  costs  to  the  user.  When,  for  example,  the  user  fails  to  indicate  that  he 
is  presenting  ?.  subject  term,  the  system  might  begin  by  guessing  that  he  has  an  author 
in  mind.  If  there  is  i.o  author  with  the  appropriate  name  (for  example,  S.  Halides), 
the  only  cosi  is  the  cos'  of  the  additional  computer  time  required  to  determine  that  no 
such  author  name  exists  in  the  system.  If,  however,  by  coincidence,  an  appropriate 
author  name  does  exist,  there  is  an  additional  cost  in  the  confusion  which  may  arise. 
The  aim  of  the  guidance-system  design  is  to  attempt  to  minimize  such  difficulties  in  the 
hope  that  the  user  who  is  saved  the  cost  of  learning  about  the  system  will  find  the  bal¬ 
ance  in  his  favor  . 

The  guidance  system  keeps  track  of  where  a  user  is  through  a  moving  point  to  a 
set  of  interconnected  nodes,  and  controls  its  reactions  to  what  the  user  says  by  look¬ 
ing  at  the  contents  of  the  nodes  to  which  the  pointer  points.  Each  of  the  nodes  in  the 
interconnected  set  of  nodes  (or  graph)  is  a  serially  ordered  set  of  four-element  data 
structures  called  subnodes  .  The  first  such  subnode  in  a  given  node  controls  the  opera¬ 
tions  of  the  node  itself  while  the  remaining  subnodes  control  what  is  said  to  the  user, 
what  the  user  rnay  say  in  return,  what  is  done  when  he  says  something,  and  finally 
where  the  pointer  is  mov.  d  to,  if  an  appropriate  response  is  made. 

Thus,  for  example,  in  the  sample  dialog  segment  above,  the  first  message  printed 
to  the  user  is  controlled  by  a  node  which  also  interprets  his  response.  Here  ihr  user's 
statement  might  first  be  checked  (via  the  second  subnoue)  to  see  if  it  is  a  sign-in;  then 
if  this  test  failed,  it  wouid  be  checked  to  see  if  it  were  a  subject  term,  an  author  name, 
and  »o  forth.  The  reactions  are  controlled  by  subsequent  elements  in  the  initial  node. 
When  a  subject-term  match  is  found  in  the  inverted  Hies,  control  is  turned  over  to  the 
Intrex  system,  but  before  this  happens,  the  fourth  element  m  the  subnode,  according 
to  which  the  term  was  recognized,  moves  the  user’s  pointer  to  a  new  node  which  con¬ 
trol*  die  next  response.  It  is  this  node  that  is  used  to  respond  to  the  user’s  request 
tor  more  information. 

An  initial  version  of  this  facility  is  currently  being  programmed. 

data-base  generation 

Inputting 

The  inputting  of  catalog  information  stored  on  paper  tape  into  the  system,  its  on¬ 
line  editing,  and  associated  processing  has  continued  in  a  production  routine  as  de¬ 
scribed  in  previous  Activity  Reports.  An  experiment  to  evaluate  the  implication*  of 
online  inputting  of  catalog  information  via  an  IBM  2H)  directly  into  the  C  TSS 
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(Compatible  Time-Sharing  System)  was  performed  and  results  are  reported  in  Section 
B.  1.  From  a  computer  viewpoint  a  significant  observation  can  be  made,  namely,  that 
the  online -inputting  procedure  appears  to  be  rather  sensitive  to  the  host  environment 
(CTSS)  and  the  specific  programs  which  are  used  to  receive  the  input.  Additional  in¬ 
vestigation  is  planned  when  the  local  Intrex  console  and  buffer/rontroller  become  avail¬ 
able.  There  is  some  hope  that  this  satellite-computer  system  may  make  online  input¬ 
ting  more  attractive. 

Programming  Changes 

As  experience  was  gained  in  the  operation  of  the  programs  for  Phase-II  inverted- 
file  and  catalog-record  generation,  several  opportunities  for  achieving  greater  pro¬ 
gramming  efficiency  were  noted.  One  kind  of  improvement  is  the  more  fully  automated 
chaining  of  the  programs.  Another  kind  of  improvement  involves  the  omission  of  most 
line-ending  characters  and  tabs  as  originally  typed  from  the  catalog  records  in  order 
to  obtain  easier  formatting  of  output  on  consoles  having  different  maximum  line  lengths 
(IBM  2741,  about  120  characters;  ARDS,  about  80;  Intrex  display  console,  about  60). 
The  revised  formatting  program  is  described  below. 

As  soon  as  these  revised  programs  are  fully  debugged  we  intend  to  pursue  the 
generation  of  the  inverted  file  and  catalog  record  data  base.  V'e  hope  to  expand  quickly 
the  number  of  documents  in  storage  from  the  present  48  to  at  least  about  1000  so  that 
we  may  uegin  experiments  with  real  users  with  a  data  base  allowing  meaningful  re¬ 
trieval. 


F v^rmatting  Program 

The  formatting  program  takes  the  edited  catalog  records  from  the  disc  and  re¬ 
formats  them  for  efficient  storage  and  retrieval.  The  purposes  of  the  program  are: 
to  reduce  the  amount  of  space  required  for  storage;  to  store  data  in  a  form  that  min¬ 
imires  retrieval  time;  and  to  detect  remaining  errors  wherever  possible. 

The  formatting  program  produces  three  major  types  of  output:  an  updated  catalog- 
record  file  produced  by  adding  the  new  catalog  records  in  its  input  to  the  existing  file; 
shred  files  which  provide  the  raw  materials  used  for  updating  the  inverted  files,  and  the 
catalog  directory  file  through  which  the  retrieval  system  accesses  the  catalog  records. 

Catalog-Record  File.  The  catalog-record  file  contains  the  information  about 
documents  that  have  been  entered  by  the  catalogers.  Information  in  this  file  is  struc¬ 
tured  so  that  the  formatting  information  (which  indicates  where  records,  and  fields 
within  records,  begin  and  end)  is  separated  from  content  information.  The  formatting 
information  is  contained  in  the  header  (Fig.  B - 7 )  which  indicates  where  a  given  record 
or  field  begins  and  ends.  The  rest  of  the  information  is  stored  in  the  upper  and  lower 
oodies.  The  upper  body  contains  the  information  that  does  not  require  a  free  format. 
Such  information  can  be  both  compressed  to  save  space  and  preformatted  to  simplify 
retrieval.  Among  the  fields  included  in  this  portion  of  the  catalog  record  are  the 


record  number,  the  level  of  treatment,  language,  and  the  like  (see  Fig.  B-7).  For 
each  of  the  fields  included  in  the  upper  body  there  is  an  escape  option  for  use  when  the 
information  in  these  fields  does  not  fit  within  the  pre-established  format  and  must  be 
stored  in  the  lower  body. 

The  bulk  of  the  information  in  a  catalog  record  is  stored  as  straight  text  in  the 
lower  body,  Characters  that  serve  to  delimit  fields  and  record  entries  are  removed, 
as  are  formatting  characters  (carriage  returns  and  tabs)  whose  position  is  largely  a 
function  of  the  line  width  of  the  output  device.  The  average  formatted  catalog  record 
requires  approximately  500  computer  words  for  storage. 

Shred  F iles ,  The  shred  files  firm  the  basis  for  updating  the  inverted  files. 
Shred  files  are  produced  from  the  information  in  the  author,  title,  and  subject  files. 
Separate  shreds  are  produced  for  each  author  last  name,  and  for  each  word  or  item  in 
the  subject  and  title  fields.  These  files  provide  the  input  for  the  programs  that  gen¬ 
erate  the  inverted  files  by  stemming  the  title  and  subject  words  in  the  shreds,  sorting 
the  result,  and  amalgamating  it  with  the  existing  inverted  files. 

The  Catalog-Directory  File.  The  retrieval  programs  access  individual  records 
through  their  record  numbers.  The  purpose  of  the  catalog-directory  file  ;s  to  tie  this 
record  nuinoer  to  the  physical  location  of  that  record  within  the  computer.  The  first 
ten  words  of  the  catalog-directory  file  (Fig.  B-8)  serve  to  store  general  information 
about  the  catalog  records  and  the  directory  file  itself.  The  remainder  serve  to  store 
pointers  to  the  catalog  records  with  the  n-th  word  in  this  portion  pointing  to  record 
number  n.  The  catalog -directory  file  is  separated  from  the  catalog  records,  and  all 
accessing  of  such  records  is  done  via  this  file,  in  order  to  make  it  possible  to  change 
the  physical  location  of  a  record  for  purposes  of  correcting,  appending,  or  otherwise 
updating  records  while  making  a  change  in  only  one  place  in  the  system. 

Program  Operation.  The  formatting  program  requires  approximately  two  to 
three  seconds  to  process  a  single  catalog  record.  Because  of  the  relatively  free  for¬ 
mat  of  the  data  within  most  fields,  only  a  limited  amount  of  error  detection  is  possible 
and  no  error  correction  is  attempted.  Currently, errors  are  detected  in  less  than  one 
out  of  25  records . 

Stemming 

The  work  of  Miss  Julie  Lovins  in  developing  stemming  algorithms  was  mentioned 
in  the  Semiannual  Activity  Report  of  September  15,  1967.  ihis  work,  which  has  been 
described  in  a  Technical  Memorandum  (see  Section  V),  includes  procedures  for  stem¬ 
ming  words  in  a  two-phase  process.  In  the  first  phase,  the  longest  possible  ending 
from  a  list  of  endings  is  dropped  from  the  word.  Before  an  ending  is  dropped  it  must 
satisfy  a  context  rule.  (For  example,  do  not  drop  s_  after  s_.)  The  Phase -D  Intr ex 
retrieval  system  includes  transformational  rules  to  account  for  certain  spelling  anom¬ 
alies  in  English  (for  example,  absorb/absorjj-tion;  split/split-t-ing). 
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Because  the  transformational  rules  of  Phase  II  involved  various  complexities  in 
actually  performing  the  stemming  and  keeping  account  of  it  in  the  inverted  files,  and 
because  the  number  of  additional  cases  it  handled  seemed  relatively  small  it  was  de¬ 
cided  to  try  out  only  the  Phase-I  procedures  in  initial  Intrex  systems.  A  list  of 
endings  and  context  rules  for  applying  them  are  contained  in  the  Lovins  memorandum. 

Several  thousand  subject-term  words  from  several  dozen  catalog  records  have 
been  stemmed  according  to  these  Phase-I  procedures.  So  far  the  results  seem  very 
good  and  only  a  few  modifications  to  the  original  procedures  appear  desirable.  A 
few  endings  have  been  added  and  the  minimum  stem  length  was  raised  from  two  to 
three  letters.  Figure  B-9  gives  a  statistical  compilation  of  the  number  of  endings 
found  for  each  ending  type  in  one  run  of  2,382  words  stemmed. 

MESSAGE -FILE  MAINTENANCE 

We  are  well  aware  that  feedback  is  essential  for  effective  interaction,  but  we 
also  recognize  that  the  experienced  user  may  lose  patience  with  the  verbose  mes¬ 
sages  which  are  appropriate  for  the  less  experienced  user.  With  an  eye  to  this 
problem,  our  output  routines  have  been  designed  and  implemented  to  print  sym¬ 
bolically  referenced,  externally  stored  (on  the  disc)  messages  interposed  with  in- 
core  parameters.  By  keeping  references  to  messages  symbolic  (for  example,  as 
BCD  message  names),  we  can  easily  control  what  a  particular  message  says.  This 
level  of  control  is  useful  to  modify  message  content  as  the  dial  evolves  as  well  as 
controlling  the  verbosity  of  the  dialog.  The  current  system  permits  only  a  binary 
choice,  LONG  or  SHORT.  Whenever  a  command  is  given  to  change  the  mode,  the 
name  of  the  message  file  (the  message  files  are  stored  on  the  disc)  is  changed. 

This  in  turn  causes  the  messages  to  emerge  in  their  full  or  abbreviated  form.  In 
the  future,  it  may  be  advantageous  to  provide  one  or  more  intermediate  levels  of 
verbosity  between  the  extremes  of  LONG  and  SHORT. 

The  Guide  and  other  standard  dialog  messages  are  kept  in  disc  files  except 
for  a  few  common  short  messages  which  are  kept  in  core.  Messages  may  be  readily 
changed  by  standard  CTSS  editing  programs.  The  maintenance  of  the  Guide,  de¬ 
scribed  below,  is  illustrative  of  the  required  procedures. 

Two  versions  of  the  Guide  are  required,  one  for  the  hard-copy  edition,  the 
other  for  online  use.  The  hard-copy  Guide  must  be  generated  by  a  file  containing 
instructions  for  the  manuscript  format  generating  program,  ROFF.  The  online 
version  must  contain  message  labels  and  delimiters  for  response  to  online  requests 
about  INTREX  operation. 

To  facilitate  updating  of  the  Guide,  an  ASCII  master  file  is  kept.  This  file 
contains  both  the  online  version  labels  and  delimiters  and  the  ROFF  control  instruc¬ 
tions.  Corrections  and  revisions  are  made  on  this  master  copy  through  use  of  the 
CTSS-editor  program  designated  QED.  If  an  online  Guide  version  is  desired,  one 
first  issues  approximately  five  QED  requests  to  delete  the  ROFF  instructions,  and 
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then  one  processes  this  Guide  through  the  message  directory  generator,  DIRGEN. 
For  a  hard-copy  version,  one  uses  QED  to  edit  the  labels  and  delimiters  of  the 
master  copy  of  the  Guide  and  processes  the  resulting  file  through  ROFF.  The  ROFF 
output  file  may  then  be  printed  at  the  console  or  saved,  written  on  magnetic  tape, 
and  then  processed  through  the  Project  MAC  1401  computer  with  high-speed  printer. 

The  present  hard-copy  edition  (24  July  1968)  was  produced  on  ditto  masters  at 
the  console;  the  ditto  masters  were  used  to  make  copies  of  the  40-page  guide. 

AUTOMATIC  INDEXING 

New  methods  for  generating  subject  indices  automatically  have  been  studied  by 
William  R.  Kampe  as  part  of  his  work  toward  a  Master's  degree  in  Electrical  Engi¬ 
neering.  The  results  of  this  work  are  reported  in  Mr.  Kampe' s  Master's  thesis 
which  has  now  been  issued  in  slightly  revised  form,  as  a  Technical  Memorandum. 
(See  Section  V.  ). 

The  main  motive  behind  this  research  was  to  develop  techniques  for  getting 
documents  into  the  augmented  catalog  more  rapidly.  Since  it  now  appears  that  more 
than  half  the  time  required  to  catalog  a  document  is  devoted  to  the  subject  indexing, 
the  use  of  techniques  that  could,  at  least  temporarily,  permit  the  omission  of  this 
step,  could  result  in  a  considerable  decrease  in  the  amount  of  time  required  to 
enter  a  document  into  the  data  base. 

All  the  techniques  studied  derive  the  subject  words  from  the  title  and  the  ab¬ 
stract  of  e  document.  This  information  is  included  among  the  descriptive- 
cataloging  information  that  has  to  be  entered  anyway,  so  that  no  additional  inputting 
is  required.  An  empirical  comparison  of  the  words  appearing  in  the  subject- 
index  field  with  the  words  appearing  in  these  two  fields  showed  that  over  ninety 
percent  of  the  words  appearing  in  the  subject  field  also  appeared  in  the  title  and/or 
abstract  fields  so  that  the  automatic  generation  of  subject  words  from  these  fields 
seemed  possible,  at  least  in  principle.  Furthermore,  Salton*  ha*  found  that  the 
indices  generated  from  the  title  and  abstracts  of  journal  articles  tended  to  be  ap¬ 
proximately  as  good  as  indices  generated  from  full  text. 

Three  methods  for  pre-indexing  documents  were  compared.  The  major  dis¬ 
tinctive  features  of  these  techniques  is  their  reliance  on  previous  human  indexing 
for  deciding  which  words  are  relevant  and  the  use  of  this  information  to  discard, 
rather  than  retain,  words  for  indexing.  Comparison  of  the  efficacy  of  these  tech¬ 
niques  was  made  by  matching  their  results  with  those  obtained,  independently,  by 
human  indexers. 


* 

Salton,  G.  (ed),  "Information  Storage  and  Retrieval,"  ISR-11,  Department  of  Com¬ 
puter  Sciences ,  Cornell  University,  Ithaca,  N.  Y.  ,  June,  1966. 


All  three  techniques  are  based  on  the  "usage  rate"  of  terms  in  previously  in¬ 
dexed  documents.  This  usage  rate  is  a  single  parameter  that  indicates  how  often 
the  given  word  has  been  used  in  the  subject  index  when  it  appeared  in  the  title  or 
abstract  of  the  document.  In  pre -indexing  Method  I,  words  are  retained  if  their 
usage  rate  exceeds  a  certain  threshold.  (The  efficacy  of  the  method  depends  on 
the  threshold,  and  several  levels  were  tested.  )  In  Method  II  words  with  high  usage 
rates  are  also  discarded  on  the  grounds  that  they  are  either  function  words  ('the", 
"of",  "in"  and  the  like)  or  very  common  words  in  the  collection  (such  as  "magnetic") 
neither  of  which  convey  much  information  about  the  document.  Again,  of  course, 
this  method  depends  on  the  "commonness"  threshold,  and  again,  several  levels  of 
the  threshold  were  tested.  Method  III  is  like  Method  II  except  that  an  ambiguous 
area  is  neither  very  high  nor  very  low.  Decisions  in  this  ambiguous  middle  area 
are  made  on  the  basis  of  the  "context".  If  either  of  the  neighboring  words  is  in  the 
area  of  unambiguous  inclusion  then  the  given  word  is  included,  otherwise  it  is  not. 

Evaluation  of  these  methods  was  made  on  the  basis  of  two  criteria:  complete¬ 
ness  ,  or  the  percentage  of  appropriate  words  that  could  have  been  included  and 
were,  and  relevance  or  the  percentage  of  inappropriate  words  that  were  excluded. 
Appropriateness,  in  both  cases,  was  determined  by  seeing  if  human  indexers  had 
selected  the  words. 

When  this  criterion  was  taken  without  interpretation,  Method  I  was  the  most 
successful,  achieving  over  90-percent  completeness  with  over  60  oercent  relevance 
at  its  optimal  thresholds.  However,  much  of  this  success  was  seen  to  be  based  on 
the  inclusion  of  function  words  and  other  words  with  little  information  content. 

When  these  were  excluded,  Methods  II  and  III  were  seen  to  be  better  with  Method 
ill  attaining  both  relevance  and  completeness  ratings  of  over  90  percent  at  its  opti¬ 
mal  setting.  These  results  suggest  that  rapid  pre-indexing  along  the  lines  investi¬ 
gated  is  feasible. 

AUGMENTED  CATALOG  STORAGE  DEVICE 

We  are  completing  specifications  for  an  experimental  test  setup  for  the  pur¬ 
pose  of  demonstrating  feasibility  and  predicting  performance  of  the  magnetic-tape 
device  described  in  the  preceding  Activity  Report.  The  most  radical  departure  in 
the  approach  under  investigation  fromcurrent  tape  transports  occurs  in  the  utili¬ 
zation  of  the  magnetic -transfer  process  from  the  tape  to  the  cylinder.  Accordingly, 
the  experiment  will  concentrate  on  maximizing  bandwidth  of  the  transfer  and  read 
processes.  This  involves  not  only  linear  information  density,  but  also  the  tape  ve¬ 
locity. 

In  order  to  facilitate  experiments,  the  test  facility  will  differ  from  the  pro¬ 
posed  device  previously  desc ribed.  Tape  reels  are  eliminated  and  a  continuous  loop 
of  tape  is  utilized.  We  desire  to  be  able  to  both  read  and  write  on  the  same  device. 
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This  is  accomplished  by  providing  a  write  head  which  operates  in  contact  with  the 
tape  at  writing  velocity  (less  than  100  inches  per  second)  and  over  which  the  tape 
floats  at  reading  velocity  (1,000  inches  per  second). 

Our  experiment  imposes  two  constraints  on  the  magnetic  cylinder  assembly. 
First,  we  desire  to  try  several  different  thicknesses  and  types  of  coatings,  and  thus 
would  like  to  be  able  to  replace  the  cylinder  easily.  Secondly,  the  runout  of  the 
cylinder  must  be  carefully  controlled  so  as  to  permit  in-contact  reading  at  high  ve¬ 
locity.  These  constraints  will  be  met  by  use  of  a  precision  grinding  spindle.  Each 
cylinder  blank  will  be  mounted  in  its  own  wheel  holder,  ground  true  on  the  same 
spindle,  and  subsequently  plated.  It  can  then  be  easily  and  accurately  mounted  on 
the  spindle  when  needed. 

Such  an  experimental  setup  will  demonstrate  the  writing,  transfer,  and  read¬ 
ing  of  digital  data  under  conditions  similar  to  those  of  the  proposed  device.  It  will 
allow  us  to  determine  how  variations  in  coating  thicknesses,  materials,  and  writing 
and  reading  techniques  affect  the  bandwidth  at  which  data  can  be  read  reliably. 

3.  THE  DISPLAY-CONSOLE  SYSTEM 

Staff  Members 

Dr.  D.  R.  Haring 

Mr.  J.  E.  Kehr 

Professor  J.  K.  Roberge 

SUMMARY 

In  preceding  Semiannual  Activity  Reports  a  complete  system  design  for  the 
augmented-catalog  display-console  system  and  the  functional  block  diagram  of  the 
display  console  itself  have  been  described.  During  the  present  reporting  period 
the  construction  of  the  electronics  and  logic  for  the  console  has  been  completed  and 
they  are  presently  undergoing  tests.  The  logic  design  of  the  buffer/controller  has 
also  been  completed  and  is  presently  being  construct*  d.  Minor  changes  have  been 
made  in  the  system  configuration  described  in  the  preceding  reports  in  order  to  im¬ 
prove  system  performance  and  to  take  advantage  of  special  features  of  the  detailed 
properties  of  the  logic  building  blocks  being  used. 

The  character  generator  described  in  the  preceding  Semiannual  Activity  Re¬ 
port  is  now  operating  at  the  proper  speed  and  produces  high-quality  characters.  It 
is  presently  being  integrated  into  the  logic  of  the  console. 

Work  continues  on  the  6201  buffer/controller  software,  some  of  which  is  de¬ 
scribed  here.  Current  experimental  investigations  being  conducted  on  the  display- 
console  system  are  also  presented. 


SYSTEMS  CONSIDERATIONS 

The  buffer/controller  (B/C)  of  the  display-console  system  serves  as  the  inter¬ 
face  among  several  augmented-catalog  consoles,  the  text-access  central  station  and 
the  time-shared  computer.  The  major  elements  contained  in  the  B/C  are  a  small 
processor  to  manage  the  data  flow  and  a  serial  memory  to  refresh  the  CRT  displays 
of  the  consoles.  The  serial  memory  also  provides  back-up  memory  for  the  small 
processor.  Various  features  of  the  B/C  have  been  discussed  in  previous  activity 
reports.  Details  of  its  design  and  operation  not  previously  reported  are  presented 
in  the  paragraphs  that  follow.  A  functional  block  diagram  is  shown  in  Fig.  B-10. 
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Project  Intrex  is  designing  and  constructing  the  logic  that  is  contained  in  the 
B/C.  The  purpose  of  this  logic  is  to  interconnect  the  serial  memory  and  small 
processor  with  the  various  external  devices.  The  logic  design  of  the  B/C  has  now 
been  completed  and  the  logic  is  presently  being  constructed. 
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The  Varian  Data  Machines  6201  manages  the  data  flow  in  thd  B/C  through  con¬ 
nections  to  its  E  Bus  and  input/output  (I/O)  Control  Bus.  When  the  B/C  is  first 
turned  on  and  after  the  Vermont  Research  1004S  drum  has  attained  operating  speed 
(approximately  four  minutes)  the  SYSTEMS  RESET  button  is  actuated  on  the  6201 
control  panel.  This  places  the  B/C  at  a  known  initial  state.  The  6201  operating 
programs  are  then  loaded  and  the  system  is  ready  for  operation. 

To  provide  a  foundation  for  our  discussion  about  the  operation  of  the  B/C, 
let  us  first  briefly  describe  the  operational  properties  of  the  6201  computer.  The 
standard  6201  communicates  with  peripheral  equipment  on  the  16-bit  word  parallel 
(I/O)  E  Bus  and  12-line  I/O  Control  Bus.  Each  information  transfer  occurs  under 
control  of  a  stored  program.  Information  exchanges  with  peripheral  devices  are 
synchronized  by  their  controllers.  Each  controller  and  the  device  that  it  controls 
comprise  a  peripheral  device.  The  basic  machine  provides  four  types  of  I/O 


operations  : 


External  Control:  An  external  control  code  is  transmitted  under  pro¬ 
gram  control  from  the  computer  to  a  peripheral  con¬ 
troller. 

Program  Sense:  The  status  of  a  selected  peripheral  controller  is 

interrogated  by  the  computer  under  program  control. 

Single-Word  In-  A  single  16-bit  word  of  data  is  transferred  under 

put  Transfer:  ,  .  ,  ,  ...... 

r  program  control  from  a  peripheral  controller  to  the 

6201  A  register,  B  register,  or  any  memory 
location. 

Single-Word  Out-  A  single  ih-h-.t  word  of  data  is  transferred  under 

put  Transfer:  ...  .  .  ,  .  , 

program  control  to  any  peripheral  controller  from 

the  6201  A  register.  B  register,  or  any  memory 

location. 

In  addition,  the  6201  used  in  the  R 'C  has  a  Di rect -Memory -Access -and - 
Interrupt  (DMA/l)  option.  With  the  DMA.  I  option  the  computer  is  capable  of  I/O 
data  transfers  to  or  from  memory  without  program  intervention.  This  option  allows 
external  devices  on  the  I/O  bus  to  request  two  type*  of  functions,  first,  the  inter¬ 
rupt  feature  of  the  DMA/l  option  permits  an  external  device  to  request  the  INTER¬ 
RUPT  FUNCTION.  When  such  a  request  is  made,  an  instruction,  which  is  inde¬ 
pendent  of  the  main  program,  can  be  executed.  When  two  or  more  requests  are 
received  simultaneously,  a  hard-wired  external-device -priority  scheme  determine* 
which  device  has  priority.  The  computer  is  directed  to  a  memory  location  speci¬ 
fied  by  the  interrupting  device  and  i*  caused  to  execute  the  instruction  found  at  that 
location.  Any  6201  instruction,  other  than  an  I  O  command,  may  be  executed  in 
this  manner.  Normally,  however,  the  instruction  will  be  a  iump-and -mark 
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instruction  and  will  result  in  the  processing  of  an  I/O  subroutine.  Secondly,  the 
interrupt  feature  of  the  DMA/l  option  also  permits  an  external  device  to  request 
the  TRAP  FUNCTION.  When  such  a  request  is  made,  the  trapping-device  o  ntrol- 
ler  can  either  request  that  the  (201  output  one  word  of  data  from  memory  (trap  out) 
or  request  that  the  6201  input  one  word  of  data  to  memory  (t  in).  The  address 
of  this  word  is  placed  on  the  E  Bus  by  the  controller  after  the  6201  acknowledges 
the  trap  request.  The  stored  program  is  caused  to  remain  idle  for  2.7  micro¬ 
seconds  during  which  time  the  transfer  takes  place.  This  ''cycle-steal’ng"  action 
does  not  disturb  the  operational  registers  of  the  6201,  thus  allowing  the  stored  pro¬ 
gram  to  proceed  normally  at  the  conclusion  of  the  data  transfer.  The  trap  function 
is  also  connected  to  the  priority  logic. 

When  the  B/C  is  completed,  it  will  also  have  a  device  that  is  equivalent  to 
the  Buffer-Interlace-Controller  (BIC)  option  w  lich  is  available  for  the  6201,  The 
BIC  generates  the  addressing  and  timing  signals  that  control  data-word  transfer 
between  the  selected  peripheral  device  and  the  computer  through  the  trap  function. 
This  option  controls  the  transfer  of  data  blocks  between  the  memory  of  the  6201 
and  peripheral  I/O  devices  concurrently  with  program  operation  in  the  6201.  Full 
16-bit  word  transfers  can  be  controlled  at  rates  up  to  202,  000  words  per  second 
(3.232  megabits  per  second). 

Further  details  about  the  operation  of  the  6201  and  its  options  can  be  found  in 
the  appropriate  Varian  Data  Machines  manuals. 

Observe  in  Fig.  B-10  that  six  peripheral  devices  are  connected  to  the  E  Bus 
and  I/O  Control  Bus  of  the  6201.  Each  of  these  devices  has  at  least  one  address  by 
which  it  can  be  idrntifiec  by  the  computer.  Only  the  Drum  Controller  and  Real- 
Time  Clock,  the  Data-Phone  Interface  A,  and  the  Buffer  Interlace  Controller  (BIC) 
have  interrupt  capabilities.  The  Drum  Controller  and  Real-Time  Clock  has  the 
highest  priority,  followed  by  the  BIC  Buffer  Interlace  Controller  which,  in  turn,  is 
followed  by  the  Data-Phone  Interface  A.  The  remaining  peripherals  operate 
strictly  under  program  control  by  the  computer. 

When  commanded  to  do  so  by  means  of  the  B  Bus,  the  BIC  can  control  data 
transfers  between  the  6201  and  either  the  Data-Phone  Interface  A  or  the  Drum  Con¬ 
troller  and  Real-Time  Clock.  To  increase  system  flexibility  the  Drum  Controller 
t  an  be  commanded  to  perform  much  like  the  BIC  to  transfer  fixed-length  blocks  of 
data.  The  block  length  is  equivalent  to  one  line  of  data  that  can  be  displayed  on  the 
c  ons  o  le  CRT  (Sfc  characters). 

External -Control  commands  allow  the  e20I  to  control  the  various  modes  of 
operation  of  the  Drum  Controller,  and  Program-Sense  command*  allow  the  6201  to 
determine  the  status  of  the  data  registers  in  the  controller.  Finally,  provisions 
have  been  made  to  connect  the  controller  to  the  BIC  in  order  to  do  arbitrary-s  ised 
block  transfers  of  data. 
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The  Real-Time  Clock  is  used  to  time  events  in  the  console  system.  It  is 
simply  a  16-bit  binary  counter  that  is  advanced  by  one  for  each  drum  revolution. 

Since  the  drum  turns  approximately  57.5  revolutions  per  second,  a  new  epoch  occurs 
approximately  every  20  minutes.  The  computer  can  request  the  reading  of  this  clock 
at  any  time  by  a  DATA  IN  command. 

In  the  initial  system,  communications  betwet  the  time-shared  computer  and 
the  B/C  is  by  means  of  a  data  phone,  which  is  connected  to  the  6201  by  means  of 
Data-P'none  Interface  A.  This  interface  is  designed  to  operate  through  a  standard 
1200-bps  full-duplex  data  phone  and,  with  minor  modifications,  to  operate  up  at  the 
full  bandwidth  of  the  6201.  In  addition  to  the  sti  \rd  DATA  IN'  and  DATA  OUT  com¬ 
mands,  data  transfers  to  and  from  the  time-shared  c  mputer  can  operate  on  the 
interrupt  function  of  the  6201. 

By  External -Control  commands  it  is  possible  to  mask  out  input  and  output 
interrupts  selectively.  Program-Sense  commands  allow  the  6201  to  determine  the 
status  of  the  data  registers  in  the  interface.  Finally,  provisions  have  been  made  to 
connect  the  interface  to  the  BIC  to  block-transfer  data. 

Communications  h  tween  the  B/C  and  the  text-access  central  station  is  oy 
means  of  a  data  phone,  which  is  connected  to  the  6201  by  means  of  Data-Phone  Inter¬ 
face  B.  This  interface  is  designed  to  operate  through  a  standard  150-to  300-bps 
full-duplex  data  phone.  Only  the  standard  DATA  IN  and  DATA  OUT  commands  can 
be  used  to  transfer  data  between  the  6201  and  the  text-access  central  station. 
Furthermore,  data  are  permitted  to  flow  in  only  one  direction  at  a  time. 

Program-Sense  commands  allow  the  6201  to  determine  the  status  of  the  data 
register  in  the  interface. 

The  augmented-catalog  consoles  are  connected  to  the  6201  l/O  buses  by  means 
of  the  Console  Buffer  and  the  Console  I/O  Controller.  The  former  buffers  the  data 
that  are  transferred  from  any  conso!-  to  the  S/C  by  means  of  two  16-bit  registers. 
Program-Sense  commands  allow  the  computer  to  determine  the  status  of  these  data 
registers,  and  DATA  OUT  commands  transfer  the  data  -  register  contents  to  the 
6201, 

A  register  that  is  composed  of  a  single  flip  flop  in  each  Console  Logic  inter¬ 
face,  but  addressed  through  the  Console  I/O  controller,  stores  the  'request-for- 
scrvice"  signals  from  the  tons  ies.  Upon  execution  of  the  proper  DATA  IN  command, 
this  register  is  t  r  io.*fe  rred  to  the  6201. 

A  second  register  that  is  made  up  of  a  tingle  tlip  flop  sn  each  Console  I.ogu 
Interface,  but  addressed  through  the  Console  I/O  controller,  is  set  by  the  proper 
DATA  OUT  command  o  initiate  the  transfer  of  data  from  a  selected  console  to  the 
C onsole  fluff  e  r . 

Figure  B-10  shows  that  data  are  sent  directly  from  the  drum  to  the  augmented  - 
catalog  consoles  without  going  through  the  F.  flu*  of  the  6201.  The  role  of  the  6201  ir. 
this  case  is  simply  to  select  which  drum  track  is  being  viewed  by  c-Hat  console. 


A  change  has  been  made  in  the  method  of  sending  the  INTERROGATE  signal 
from  the  B/'  to  an  augmented-catalcg  console.  Previously,  it  was  reported  that 
this  signal  would  be  a  burst  of  sine  waves.  Because  it  is  simpler  from  the  sys- 
t'  .  t  s  viewpoint,  the  INTERROGATE  signal  is  sent  by  making  one  of  the  data  pulses 
from  the  B/C  to  the  given  console  longer  in  time  duration  than  all  other  data  pulses. 

Detailed  functional  block  diagrams  of  the  individual  elements  contained  in 
the  B/C  will  be  discucsed  in  future  ac  ivity  reports. 

CHARACTER -GENERATOR  AND  DISPI  AY-TUBE  CIRCUITRY 

The  operation  of  the  flying-spot  scanner  character  generator  has  been  de¬ 
scribed  in  -arUer  reports.  The  initial  display  system,  consisting  of  this  charac¬ 
ter  generator  and  an  entertainment -quality  cathode- ray  tube  has  been  completed 
and  this  system  is  operaiiona  .  Figure  B-ll  illustrates  a  display  produced  by  the 


Fig.  B-ll  A  Display  of  the  Output  of  the  Character  Generator 


system.  The  full  56  by  31  character  display  is  ind:  ated  in  the  figure.  The  display 
is  refreshed  at  a  nominal  60-Hz  rate. 

The  current  equipment  demonstrates  the  feasibility  of  the  proposed  character 
generation  and  display  system.  Only  minor  changes  to  further  improve  character 
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quality  are  required  to  convert  this  prototype  system  to  final  form.  The  cathode- 
ray  tube  will  be  changed  to  14-in.  diagonal  measure  from  the  lb-in.  unit  now  used. 
The  new  tube  will  have  a  90°  deflection  angle  rather  than  the  110°  angle  of  the 
present  tube,  and  v/ill  incorporate  improved  electron  optics.  These  changes  are 
required  because  of  the  beam  defocusing  which  occurs  at  large  deflection  angles  with 
the  present  tube.  Several  tubes  are  now  being  evaluated.  A  higher  quality  yoke, 
available  from  Celco  Electronics,  will  be  used  tc  deflect  the  beam  of  the  new  tuoe. 
These  modifications  should  reduce  the  spot  size  by  approximately  a  factor  oi  two 
compared  with  the  present  system,  and  produce  characters  over  the  entire  screen 
with  better  quality  than  those  now  obtained  in  the  center  of  the  disp’ay. 

The  beam-positioning  and  other  required  circuitry  for  a  monoscope  charac¬ 
ter  generator  have  been  designed  and  will  be  tested  in  the  near  future.  It  will  be 
possible  to  use  the  monoscope  generator  as  a  direct  replacement  for  the  flying- 
spot-scanner  character  generator  in  situations  where  the  larger  capacity  (256  char¬ 
acters  for  our  display  compared  with  9e  for  the  monoscope)  and  ease  of  alteration 
of  the  character  set  are  not  required.  The  monoscope  generator  is  physically 
smaller  and  less  expensive  than  our  flying-spot-scanner  unit. 

A  modified  display  system  which  may  result  in  the  design  of  a  single  console 
suited  to  both  catalog  and  text-access  use  is  also  under  investigation.  As  explained 
in  the  next  section,  one  of  the  text-access  stations  will  use  a  Tektronix  Type  611 
Storage  Display  Unit  to  provide  a  transient  display  of  full  text.  This  display  unit 
has  sufficient  resolution  for  catalog  use,  but  has  several  disadvantages  when  oper¬ 
ated  in  the  storage  mode.  The  inflexibility  of  the  stored  display  compared  with  a 
refreshed  display  prevents  the  use  of  a  light  pen  and  increases  the  demands  on  the 
centra'  processor  when  editing  is  required.  Furthermore,  the  lifetime  of  the  tube 
in  the  storage  mode  is  relatively  restricted,  leading  to  high  operating  costs. 

It  is  possible  to  use  the  storage  tube  in  a  non-store  mode,  and  in  this  mode 
the  disadvantages  indicated  above  are  not  present.  It  therefore  seems  that  the  use 
of  the  single  display  tube  in  a  storage  mode  for  text  access  and  in  a  non-store,  re¬ 
freshed  mode  for  catalog  use  may  esult  in  a  single,  economical  console  which 
combines  these  two  major  functions. 

The  deflection  bandwidth  of  the  Type  611  Storage  Display  Unit  is  insufficient 
for  a  2000-c,,aracter  display  refreshed  at  a  60-Hz  rate.  However,  the  techniques 
which  have  already  been  developed  for  the  existing  character-generator  and  display- 
tube  system  can  be  used  with  the  Tektronix  display.  A  Type  611  Display  Unit  has 
been  ordered  for  evaluation  in  this  application.  The  unit  will  be  modified  by  in¬ 
cluding  the  sinusoidal  scan  signal  and  by  increasing  the  bandwidth  of  its  blanking 
circuitry.  The  modified  display  unit  will  then  be  evaluated  using  the  existing  char- 
;  ter  generator. 
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1.  SUMMARY 

The  design  for  the  experimental  text-access  system  is  essentially  complete 
and  the  system  is  being  assembled.  The  basic  features  of  the  system  are  described 
in  the  preceding  semiannual  report:  additional  details  are  described  below.  The 
system  will  provide  remote  access  to  the  full  text  of  the  approximately  10,000  docu¬ 
ments  being  included  in  the  augmented  catalog.  Full  text  will  be  stored  on  micro¬ 
fiche.  The  text-access  system  will  be  used  in  information-retrieval  experiments 
in  conjunction  with  the  catalog.  Two  types  of  user  terminals  are  included  in  the 
initial  system.  One  terminal  provides  35-mm  film  facsimiles  of  requested  text, 
and  the  other  utilizes  an  erasable,  electronic  storage-tube  display. 

The  camera-and-processor  equipment  for  the  film  terminal  has  been  com¬ 
pleted  and  the  film-cutting  and  transport  mechanisms  have  been  tested  successfully. 
The  logic  circuits  for  controlling  this  terminal  are  currently  undergoing  tests. 

The  modified  Houston/Fearless  Compact  Automatic  Retrieval  Device  (CARD) 
unit,  which  we  shall  use  for  microfiche  storage  and  retrieval,  has  been  received 
and  is  operating  properly  after  correction  of  misalignments  which  occurred  during 
shipping.  The  flying-spot  scanner  is  mounted  on  the  CARD  unit  and  the  integrated 
operation  of  the  scanner  with  the  retrieval  device  will  begin  shortly. 

A  working  model  of  the  video-signal  transmission  subsystem  has  been  com¬ 
pleted  and  tested.  This  equipment  is  now  awaiting  integration  into  the  rest  of  the 
text-access  facility  arid  system  testing  with  the  central-station  logic  circuits.  An 
analysis  of  various  transmission-network  configurations  based  on  loading,  distance, 
frequency  characteristics,  and  so  forth  is  being  formulated. 

The  design  of  the  text-access -system  control  electronics  is  complete  and 
circuit  boards  are  being  fabricated.  The  Tektronix  Type-611  Storage  Display  Unit 
for  the  transient-display  terminal  has  been  received  and  the  circuits  for  controlling 
the  terminal  are  being  assembled. 

It  is  expected  that  initial  operation  of  the  experimental  text-access  system 
will  begin  early  in  the  next  report  period. 
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2. 


SYSTEM  DESIGN 


Figure  C-l  illustrates  the  configuration  of  the  experimental  text-access  sys¬ 
tem.  The  central  station  includes  a  modified  Houston/Fearless  CARD  unit  which 


Fig,  C-1  Experimental  Text-Access  System 


stores  and  retrieves  the  microfiche  containing  the  text  images,  a  flying-spot 
scanner  for  converting  these  images  to  a  video  signal,  the  text-access -system 
control  electronics  (TASC),  and  the  transmitting  electronics.  The  retrieval  de¬ 
vice  can  access  any  microfiche  within  five  seconds  and  can  change  to  any  page  on 
a  given  microfiche  within  one  second.  The  transmitter  combines  the  video  with 
appropriate  synchronizing  pulses  and  coded  pulses  for  addressing  and  controlling 
the  receiving  terminals.  A  page  of  text  is  scanned  only  once  per  request  and  is 
transmitted  over  a  4.5  MHz-bandwidth  transmission  network.  The  receiving 
terminals  decode  the  address  pulses  and  receive  only  the  video  signals  directed 
to  that  terminal. 

Two  types  of  terminals  are  included  in  the  initial  system,  a  film  terminal 
producing  35-mm  film  images  of  the  scanned  text  and  a  transient  display  utilizing 
a  Tektronix-61 1  Storage  Display  Unit.  The  film  terminal  reconstructs  the  text 


image  from  the  video  signal  on  a  high-resolution  CRT  and  records  the  single-scan 
image  on  film.  Transmissions  to  this  terminal  require  approximately  1  sec  per 
page  and  approximately  1  sec  between  pages.  Multiple  pages  are  recorded  on  a 
film  strip  with  a  maximum  of  10  pages  per  strip.  An  automatic  processor  coupled 
to  the  camera  delivers  the  dry,  developed  film  strips  one  minute  after  the  last 
transmission.  This  type  of  terminal  is  the  only  available  type  which  can  achieve 
adequate  resolution  for  storing  text  from  professional -journal-type  documents,  and 
it  provides  a  form  of  remote  hard  copy  foi  off-line  reading. 

The  transient  display  is  co-located  with  the  augmented-catalog  console  and 
provides  a  quick-look  capability  for  scanning  the  text  of  documents  associated  with 
searches  in  the  augmented  catalog. 

The  limited  resolution  and  writing  rate  of  the  storage  tube  necessitate  system- 
parameter  modification  when  this  unit  is  used.  Accordingly,  a  1000-line  scan  and 
a  full-page  writing  time  of  four  seconds  are  used  with  the  storage  tube.  In  an  ef¬ 
fort  to  partially  overcome  the  limited  resolution  of  the  storage  tube,  we  are  pro¬ 
viding  external  circuitry  which  will  allow  the  user  to  request  a  factor-of-two 
enlargement  of  a  quarter-page  centered  on  one-of-nine  selectable  points  of  a  page. 

Requests  for  documents  can  be  placed  either  through  the  augmented-catalog 
console  being  developed  by  1NTREX  or  through  the  teletypewriters  used  with  the 
central  time-shared  computer.  The  augmented-catalog  console  will  contain  a  text- 
access  mode  during  which  the  programmable  buttons  and  keyboard  are  utilized  to 
input  user  actions  associated  with  text-access  requests.  Function  switches  at  the 
text-access  display  will  be  provided  to  flip  microfiche  frames  of  a  document  in 
either  direction. 

Some  effort  has  been  made  in  the  design  of  the  central-station  electronics  to 
minimize  the  load  imposed  by  the  text-access  function  on  the  augmented-catalog 
buffer/controller.  This  minimization  of  load  is  accomplished  by  implementing 
many  of  the  control  routines  in  the  central-station  hardware  rather  than  through 
computer  software. 

3.  STORAGE  AND  RETRIEVAL  DEVICE 

The  Houston/Fearless  CARD  device  was  delivered  in  June  of  this  reporting 
period,  but  was  damaged  in  transit.  The  alignment  of  the  selection  and  retrie/al 
mechanism  was  upset,  and  this  upset  caused  very  poor  reliability  during  the  re¬ 
trieval  cycle.  Retrieval  malfunctions,  in  turn,  resulted  in  damage  to  the  set  of 
microfiche  used  for  testing  the  unit.  Since  realignment,  the  device  has  functioned 
well  and  it  is  expected  that,  upon  procurement  of  a  new  set  of  microfiche,  its  per¬ 
formance  should  be  satisfactory. 

The  CARD  device  is  designed  to  store  up  to  750  microfiche  with  access  times 
of  less  than  five  seconds  to  any  microfiche  and  one  second  to  any  frame  on  that 
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fiche.  The  microfiche  conform  to  the  COSATI  standard  and  contain  five  rows  by 
12  columns  of  page  images.  The  storage  capacity  of  45,000  pages  is  expected  to  be 
adequate  for  most  of  the  10,000  documents  being  selected  for  the  initial  augmented- 
catalog  system,  since  these  documents  are  averaging  approximately  five  pages 
per  document.  The  microfiche  are  stored  in  this  device  in  bins  located  on  a 
carousel  with  20  to  26  microfiche  per  bin.  The  maximum  of  26  fiche  per  bin  pro¬ 
vides  storage  approximately  10  percent  above  the  design  capacity  of  750  microfiche. 

The  modifications  which  have  been  made  to  the  standard  CARD  unit  are  those 
required  to  integrate  the  device  with  a  flying-spot  scanner  and  to  control  the  fiche- 
and  frame-selection  circuits  from  computer-generated  inputs.  The  scanner  is  the 
same  as  that  used  for  the  experimental  image -transmission  system  described  in 
previous  reports.  The  CRT  of  the  scanner  and  the  CARD  unit  are  mounted  on  the 
same  base  to  maintain  rigidly  the  distances  between  the  scanner  CRT,  lens,  and 
the  film  plane.  A  photograph  of  the  modified  CARD  unit  and  scanner  is  shown  in 
Fig.  C-2. 


Fig.  C-2  The  Modified  Microfiche  Siorage  Unit  and  Flying-Spot  Scanner 


The  projection  lamp  used  for  viewing  in  the  standard  CARD  unit  is  replaced 
with  the  photomultiplier  tube  for  the  flying-spot  scanner.  For  focusing  purposes, 
the  scanner  lens  can  be  moved  along  the  optical  axis  by  means  of  an  electric  motor. 
The  electrical  modifications  to  the  standard  unit  permit  the  retrieval  cycle  to  be 
controlled  by  signal  voltages  from  the  central-station  logic  circuits  in  addition  to 
the  standt  *d  CARD  keyboard.  The  interface  between  the  central  station  and  the 
CARD  unit  is  described  in  Section  7. 

4.  TRANSMISSION  SUBSYSTEM 

The  development  effort  during  the  reporting  period  has  resulted  in  a  working 
model  of  the  transmission  subsystem  described  in  the  preceding  Activity  Report. 

Two  important  changes  in  the  system  have  been  made:  a  programmable  line 
counter  had  been  added  to  the  transmitter,  and  a  decision  has  been  made  to  employ 
baseband  transmission  exclusively  in  the  first  experimental  text-access  system. 

A  diagram  of  the  transmission  subsystem  in  its  present  form  is  shown  in 

Fig.  C-3,  where,  for  simplicity,  only  one  receiver  is  shown.  Scanning  parameters, 

such  as  horizontal  period  and  number  of  lines,  are  digitally  programmable  since 

they  vary  from  terminal  to  terminal.  The  programmable  counters  and  crystal 

oscillator  used  to  implement  this  feature  result  in  scanning  parameters  which  are 
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accurate  to  within  one  part  in  10  .  The  digital  signals  will  originate  in  the  Text- 
Access -System  Control  logic  (TASC  logic)  described  in  another  section  of  this  re¬ 
port.  The  address  and  command  signals  mentioned  in  the  last  progress  report  are 
also  generated  by  the  TASC  logic. 

The  line  driver  has  been  designed  and  built  s  a  standard  circuit  which  will 
also  serve  as  a  video  amplifier  in  certain  user  terminals.  This  amplifier  combines 
the  digital  and  analog  signals  and  drives  a  75-ohm  coaxial  cable  with  a  56-volt, 
peak-to-peak,  signal.  The  frequency  characteristics  of  the  cable  can  be  compen¬ 
sated  by  means  of  suitable  feedback  networks  in  the  line  driver. 

The  receiver  analog  circuitry  shown  in  Fig.  C-3  consists  of  a  variable  at¬ 
tenuator,  voltage  follower,  peak  detector,  and  comparator  connected  as  an  automatic- 
gain-control  (AGC)  system,  and  a  matched-filter  and  threshold  comparator  which 
serve  to  separate  the  digital  signals.  The  AGC  system  compensates  for  10-to-i 
variations  in  the  low-frequency  gain  of  the  system  which  might  result  from  temper¬ 
ature  changes  or  changes  in  separation  distances  between  transmitter  and  receiver. 
The  receiver  digital  circuitry  was  discussed  in  the  preceding  Activity  Report. 

A  set  of  rules  for  the  formation  of  a  network  of  receiving  terminals  is  under 
development.  Based  on  loading,  frequency  characteristics,  distance,  and  other 
considerations,  these  rules  will  determine  the  extent  and  form  of  text-access  sys¬ 
tems  which  use  the  standard  components.  It  is  evident  that  the  maximum  distance 
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will  be  determined  by  the  transmission  subsystem  while  the  maximum  number  of 
terminals  will  be  determined  by  limitations  of  the  scanner  and  automatic -retrieval 
mechanism. 

Because  of  the  experimental  nature  of  the  text-access  system,  an  attempf  has 
been  made  to  design  the  transmission  subsystem  flexibly  and  with  an  expansion 
capability.  These  features  are  reflected  in  the  following  facts  about  the  system: 

The  programmable  digital  circuitry  in  the  transmitter 
is  capable  of  generating  a  wide  range  of  sweep  rates 
and  numbers  of  lines . 

Provision  has  been  made  for  six  address  and  seven 
command  bits,  although  only  two  addresses  and 
twelve  commands  are  required  for  the  first  text- 
access  system. 

The  crystal -controlled  blocks  and  the  AGC  circuitry 
minimize  the  number  of  adjustme  .ts  required;  re¬ 
ceiver  terminals  can  be  mov  d  a  will  without  any 
design  changes . 

If  carrier  transmission  is  found  nec  ssary,  the  signal 
format  need  not  be  changed. 

The  choice  of  4.  5  MHz  for  the  basic  clock  frequency 
permits  the  addition  of  a  pilot  carrier  to  eliminate  the 
crystal  oscillators  now  used  in  the  receivers.  This 
pilot  carrier  could  also  be  used  to  provide  a  high- 
frequency-gain  reference  for  adaptive  long-distancc- 
cable  compensation. 

The  system  is  versatile,  yet  not  excessively  complex.  Future  terminals  ca.. 
be  added  as  required  without  modification  of  transmission-subsystem  elements. 

5.  TRANSIENT -PIS PLAY  TERMINAL 

The  design  and  construction  of  the  transient-display  terminal  utilizing  an 
electronic  storage  tube  is  nearing  completion.  The  terminal  will  be  used  in  con¬ 
junction  with  the  augmented -catalog  console  and  will  be  located  adjacent  to  the 
console.  Requests  for  full  text  of  documents  are  initiated  through  the  augmentcd- 
catalog  console  operating  in  the  text-access  mode  by  means  of  the  programmable 
buttons  and  function  switches.  The  programmable -button  label-  assist  the  user  in 
selecting  various  options  during  the  text-request  sequence  such  as  the  choice  of 
document  and  page  number,  choice  of  storage-tube  or  film -terminal  display,  a 
full-page  display  or  a  magnified  display  of  a  page  sector,  and  so  forth. 

In  addition  to  these  input  devices  at  the  catalog  console,  three  lunction 
switches  are  located  at  the  storage-tube  display  and  connected  to  the  console.  Two 
of  these,  page  forward  and  page  backward,  enable  the  user  to  request  the  following 
or  preceding  page  by  pushing  a  single  button.  The  third  function  switch  initiates  the 
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magnify  mode  in  which  approximately  one-quarter  of  the  page  area  in  scanned  and 
transmitted,  thereby  giving  a  factor-of-two  magnification  of  that  page  sector.  In 
this  mode,  an  illuminated  rectangle  appears  as  an  overview  on  the  page  display. 
The  rectangle  outlines  the  page  sector  to  be  magnified  and  can  be  moved  to  any  one 
of  nine  positions  by  means  of  a  pushbutton  matrix  at  the  display. 

Several  indicator  lights  are  included  at  the  storage-tube  display  to  inform 
the  user  of  the  status  of  his  request  such  as  FICHE  NOT  FOUND,  LAST  PAGE  OF 
DOCUMENT,  and  REQUEST  IN  PROCESS.  The  pushbuttons  and  indicators  are 
lighted  and  operated  in  a  programmed  sequence  to  assist  the  infrequent  user  in  his 
requests . 

The  terminal  equipment  consists  of  two  main  components,  the  Tektronix 
Type-611  Storage  Display  Unit  and  the  electronics  for  controlling  the  display  and 
the  user  inputs.  The  terminal  is  designed  to  be  self-sufficient  in  that  it  requires 
no  external  power  supplies.  The  storage  tube  has  provision  for  erase,  and  x,  y, 
and  z  axes  inputs.  A  dual -channel  ramp  generator,  synchronized  to  the  scanner 
by  the  receiver  electronics,  provides  the  x-  and  y-deflection  signals .  The  video 
signal  is  applied  to  the  z  axis  to  unblank  the  tube  and  to  store  the  text  display. 
Since  the  storage  tube  is  a  bistable  device,  it  does  not  have  gray  scale  capability. 

The  rectangle  displayed  during  the  magnify  mode  is  generated  by  appropriate 
deflection  signals  to  the  x  and  y  axes  and  is  refreshed  60  times  per  second  to 
«  liminate  flicker.  The  rectangle  is  not  stored  by  the  display  in  order  that  it  can  be 
n;  ved  to  different  sectors  by  the  user.  The  write-through  mode  for  the  rectangle 
is  achieved  by  applying  a  one -microsecond  pulse  to  unblank  the  z  axis  at  a  re¬ 
petition  rate  of  approximately  20  KHz.  Because  the  dot  writing  time  of  the  tube  is 
approximately  20  microsec,  the  rectangle  is  visible  but  is  not  stored.  Although 
the  storage-tube  resolution  is  not  sufficient  for  high-quality  textual  images,  this 
terminal  should  permit  useful  experiments  with  an  erasable,  stored  display.  The 
rnagmfied-display  capability  should  enhance  the  usefulness  of  the  device,  par¬ 
ticularly  with  documents  containing  small  characters. 

b •  FILM  fERMIV  . ' 

As  rcoorted  in  the  preceding  Semiannual  Activity  Report,  we  have  purchased 
a  Kodak  MCU-II  film  unit  and  a  GAF  Transflu  1206  processor  and  have  merged  the 
two  devices  into  a  camera-processor  unit.  The  camera -proc es sor  unit,  shown  in 
Fig.  C-4.  is  in  the  final  stage  of  adjustment. 

Figure  C-5  shows  the  two  electrically-activated  magnetic -clutches  which 
couple  the  transport  drive-rollers  to  the  camera  motor  during  film  advance  and  to 
the  processor  motor  after  the  film  has  been  cut.  Three  inic  ros  witches  are  located 
in  the  film  guides  to  sense  the  travel  of  the  exposed  nlm  through  the  transport. 


Fig.  C-4  The  Camera  Proceuor 


These  sense  switches  are  connected  with  the  film-terminal  control -logic  in  such  a 
manner  as  to  prevent  successive  film  strips  from  overlapping. 

Figure  C-6  is  a  photograph  of  the  ,ake-up  chamber  of  the  Kodak  film  unit; 
it  shews  the  modifications  which  have  been  made  to  cut  off  automatically  short 
lengths  of  film  and  transport  them  to  the  processor.  At  the  end  of  a  series  of  ex¬ 
posures,  or  after  10  adjacent  frames  have  been  exposed,  the  camera  is  commanded 
to  advance  the  film  until  the  last  exposed  frame  has  cleared  the  cutter.  The  solenoid- 
operated  film-cutter  then  severs  the  exposed  film  which  is  then  moved  through  the 
film  guides  to  the  processor  by  means  of  three  neoprene  drive- rollers . 

Additional  modifications  were  made  to  the  Kodak  film  unit  to  hold  the  receiver 
lens  in  a  precise  position  with  respect  to  the  CRT  and  film.  The  camera  and  CRT 
were  mounted  on  a  one -inch  thick  aluminum  plate  which  was  in  turn  bolted  to  the 
side  of  the  GAF  processor  using  shock-absorbing  mountings. 

The  operation  of  the  camera-processor  is  initiated  by  a  digital  signal  trans¬ 
mitted  from  the  central  station.  The  actual  sequencing  of  the  camera-processor  is 
accomplished  by  the  film-terminal  control  circuitry.  The  logic  for  thiB  circuitry  is 
constructed  from  DTL  and  TTL  dual-in-iine  integrated  circuits.  The  camera 
solenoid,  the  film-cutter  solenoid,  and  the  control  relay  in  the  processor  require 
approximately  five  amps  and  this  current  is  switched  with  silicon  coaiiullc.il  ic.-i.i- 
fiers , 

To  insure  that  textual  images  will  be  transmitted  to  the  film  terminal  only 
when  the  camera  shutter  is  open,  the  camera  is  loaded  with  film,  and  the  CP.T  is 
operative,  a  status  level  wiil  be  transmitted  to  the  central  station. 

In  order  to  conserve  water  and  mechanical  -  ear  on  the  GAF  processor,  a 
unijunction -transistor  timer  has  been  constructed  to  cycle  the  processor  in  the 
following  m inner.  nmediately  after  the  initial  turn-on  of  the  receiver  station, 
the  processor  will  run  for  17  minutes  to  warm  up.  Following  this  initial  warm-up 
period,  the  processor  will  he  sv’itched  to  a  stand-by  condition  until  a  request  to 
process  a  film  strip  has  bean  received.  If  15  minutes  elapses  before  the  next  re¬ 
quest,  the  processor  will  be  run  for  two  minutes  to  maintain  the  solution  temper¬ 
atures,  and  then  return  to  the  stand-by  condition. 

As  reported  in  the  preceding  Semiannual  Activity  Report,  the  success  of  the 
microfilm-facsimile  terminal  depends  upon  the  availability  of  convenient  microfilm 
viewers  for  the  film  strips  that  emerge  from  the  camera-processor  unit.  A 
prototype  of  a  potentially-inexpens ive ,  plastic  film-strip  holder  has  been  con¬ 
structed  and  a  device  that  will  accept  this  holder  is  being  installed  in  an  IBM-9922 
document  viewer. 
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7. 


TEXT-ACCESS  SYSTEM  CONTROL 


The  text-acce<=  a  system  control  (TASC)  electronics  provides  the  interface 
with  the  buffer/controller,  the  CARD  unit,  and  the  transmission  subsystem.  It 
receives  messages  of  one,  two.  or  three  !  bit  words  from  the  buffer/controller 
in  the  format  shown  in  Fig.  C-7.  Upon  receiving  a  user  request  through  the 
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Fig.  C-7  Input-Metsage  Format 


augmented -catalog  console,  the  buffer/controller  formats  the  message  and  sends 
only  the  words  necessary  to  process  the  request.  The  first  word  is  always  required 
because  it  contains  the  user-terminal  address  to  which  the  requested  text  will  be 
transmitted.  The  second  word  is  sent  whenever  the  request  requires  a  retrieval 
cycle  in  the  CARD  unit  or  when  digital  commands  are  transmitted  to  a  user  termi¬ 
nal.  The  third  word  is  sent  when  the  request  is  for  a  multiple  number  of  frames 
at  the  film  terminal  or  for  a  magnified  display  at  the  tran;  ient-display  terminal. 
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Fig.  C-8  TASC  Functional  Diagram 


The  message  forma.',  includes  spare  bits  for  additional  functions  that  might  be  added 
to  the  system  later.  The  command  bits  in  the  second  word  are  intended  for  trans¬ 
mitting  digitally  coded  information  to  the  central  station  and  the  user  terminals. 

Communication  between  the  buffer/controller  and  the  TASC  circuitry  will  be 
over  a  300  bit-per-second  serial  data  channel.  Each  16-bit  word  will  be  preceded 
by  an  initial  logical  ZERO  and  followed  by  a  logical  ONE.  The  TASC  electronics 
will  contain  three  16-bit  shift  and  storage  registers  corresponding  to  word  one, 
word  Two,  and  word  three,  and  data  from  the  buffer  controller  will  be  entered  into 
the  proper  TASC  shift  register.  After  the  completion  of  a  request,  or  if  a  minor 
malfunction  occurs,  the  registers  will  be  cleared  and  a  status  message  will  be  sent 
back  to  the  buffer/controller  . 

A  block  diagram  of  the  TASC  electronics  appears  in  Fig.  C-8.  The  sequence 
controller  is  a  modulo-  °  counter  which  ic  indexed  by  signals  indicting  the  completion 
of  each  step  in  the  microfiche  retrieval,  frame  positioning,  and  scanning  cycles. 

This  concept  was  chosen  because  it  allows  independent  design  of  the  logic  for  each 
vep;  independent  design,  in  turn,  simplifies  not  only  the  initial  circuitry  but  also 
future  modifications  to  the  TASC  electronics.  Industrial-grade,  dual-in-line  inte¬ 
grated  circuits  and  medium-scale-integration  devices  have  been  selected  to  imple¬ 
ment  the  logic  design.  These  components  are  mounted  on  double-sided,  copper- 
clad,  4x6  inch  circuit  boards,  as  snown  in  Fig.  C-9.  The  circuit  boards  are 
mounted  in  a  standard  19-inch  rack  which  is  assembled  into  a  cabinet,  (Fig.  C-10). 

When  the  shift  registers  have  received  a  complete  message,  a  signal  is  gen¬ 
erated  which  sets  the  sequence  controller  to  step  1.  If  word  two  was  included  in 
the  message,  the  step-1  logic  will  decode  the  command  bits.  Legal  commands  con¬ 
sist  of  the  binary  coded  numbers  0  through  7.  It  is  planned  that  these  commands 
will  be  utilized  to  switch  into  a  character-writing  mode  where  digital  characters 
may  be  sent  to  the  film  terminal  to  identify  the  f  a  strips  with  notations  readable 
by  the  unaided  eye.  If  there  is  no  second  word  or  if  the  command  0  is  sent  then 
the  sequence  controller  will  skip  to  step  2. 

The  microfiche-retrieval  cycle  is  accomplished  during  step  2.  At  the  end  of 
each  scanning  cycle,  the  microfiche  remains  in  the  CARD  positioning  device  until 
the  next  request;  a  retrieval  cycle  is  necessary  only  if  the  request  involves  a  dif¬ 
ferent  fiche.  If  the  request  is  for  a  frame  on  the  same  fiche  as  the  previous  trans¬ 
mission,  the  second  word  is  not  included  in  the  message  and  the  sequence  is  in¬ 
dexed  through  this  step.  If  a  new  fiche  must  be  retrieved,  the  b\tffer/controller 
includes  the  second  word  in  its  message.  The  fiche  number  of  three  digits  in  binary- 
coded  decimal  form  is  forwarded  fr<  n  the  shift  register  directly  to  the  CARD  unit. 

At  step  2  of  the  sequence,  a  retrieve  signal  to  the  CARD  unit  initiates  the  fiche- 
selection  cycle.  Feedback  signals  from  the  CAR  )  unit  indicate  the  end  of  the  cycle 
and  index  the  sequencer  to  the  next  step. 


In  step  3,  the  x-y  positioning  servo  of  the  CARD  unit  positions  the  requested 
frame  in  front  of  ih>-  scanner  aperture.  The  binary-coded  row  and  column  iruormation 
in  the  input  message  is  decoded  and  activates  onc-of-five  row  inputs  and  one -of -twelve 
column  inputs  to  the  CARD  unit.  When  the  position  servos  reach  the  commanded  posi¬ 
tions,  a  pulse  is  fed  back  to  index  the  sequence  controller  to  the  next  step. 

The  tolerances  for  positioning  the  page  images  on  the  fiche  during  filming  and 
f iche -as sembling  operations  are  such  that  the  page  locations  can  vary  considerably  with 
respect  to  the  COSATI  grid,  particularly  in  the  horizontal  direction  of  the  text.  To 
center  each  page  on  the  optical  axis  of  the  scanner,  a  technique  for  locating  the  left 
edge  of  the  page  image  before  scanning  will  be  included  in  the  positioning  cycle.  The 
technique  is  to  use  the  frame  positioning  of  step  3  for  approximate  positioning,  and  then 
in  step  4  to  scan  the  flying-spot  scanner  along  a  vertical  line  just  outside  the  normal 
raster.  The  fiche  is  moved  ia  a  horizontal  direction  by  the  CARD  servo  until  the  edge 
of  the  page  image  is  detected  from  the  video  signal.  All  filmed  page  images  are  sepa¬ 
rated  by  a  clear  gap  and  each  has  a  dark  oorder  surrounding  th  text.  The  effect  on 
the  video  signal  of  the  transition  from  the  gap  to  the  border  is  used  to  detect  the  align¬ 
ment  of  the  page  edge  with  the  scanned  line.  The  initial  system  will  utilize  the  edge¬ 
finding  technique  only  in  the  horizontal  direction  because  it  is  believed  that  the  vertical 
positioning  of  the  pages  on  the  fiche  is  sufficiently  accurate  to  make  a  centering  cycle 
in  that  direction  unnecessary.  When  the  edge  is  detected,  the  positioning  servos  are 
disabled  and  the  sequence  is  indexed  to  the  next  step.  The  edge  finder  will  be  imple¬ 
mented  during  the  next  report  period. 

Step  5  is  used  only  when  a  magnified  display  is  requested  at  the  t  cans  lent -display 
terminal.  As  previously  described,  the  user  can  select  a  ZX  magnification  of  a 
quarter-page  centered  on  one  of  nine  points.  During  this  step,  the  x-  and  y-seetor 
bits  in  word  3  cause  signals  to  be  sent  to  the  CARD  servos  which  offset  the  fiche  with 
respect  to  the  scanner  axis.  During  transmission,  the  scanner  raster  is  reduced  to 
one-half  its  horizontal  and  vertical  dimensions  which  causes  the  characters  in  the 
reconstructed  image  at  the  display  terminal  to  be  twice  their  normal  size.  By  shifting 
the  fiche  instead  of  the  raster,  the  scanner  is  used  as  close  to  its  optical  axis  as 
possible,  thereby  maximizing  its  resolution.  A  pulse  at  the  end  of  the  fiche -in -motion 
signal  is  used  to  indicate  that  the  fiche  is  positioned  at  the  requested  sectc  r  and  the 
sequence  controller  is  indexed  to  the  next  step. 

If  the  t  rans  lent -dis  play  terminal  is  addressed,  a  command  begin  vertical 
sweep  is  given  to  the  transmitter  logic  when  the  sequence  controller  activates  step  6. 
When  the  film  terminal  is  addressed,  the  T ASC  logic  monitors  the  film -terminal 
status  line  and  will  send  images  to  the  film  terminal  only  if  the  status  line  is  high. 

When  the  end  ol  the  vertical  scan  is  received,  the  sequence  controller  will  index  to 
ste  p  7  . 

If  a  user  requests  several  pages  from  a  document  to  be  transmitted  to  the  film 
terminal,  the  buffer/controller  includes  the  number  of  pages  in  word  3  of  the  message, 


and  this  number  is  loaded  into  a  frame  counter  during  st£p  1.  At  step  6,  the  counter 
is  indexed  down  by  one  when  a  frame  is  transmitted.  At  3tep  7,  ^the  frame  counter  is 
checked  to  see  if  there  are  any  remaining  pages  of  the  document  to  be  transmitted, 
and  if  so,  the  sequence  controller  resets  the  sequence  to  step  3.  The  registers  con¬ 
taining  the  frame  row  and  column  are  indexed  to  move  to  the  next  page  and  the  sequence 
continues  from  step  3.  When  the  frame  counter  reaches  aero  at  step  7,  the  last  page 
of  the  request  has  been  transmitted  and  a  CUT -AND -PROCESS -FILM  command  is  sent 
to  the  terminal  and  t'  e  sequence  counter  is  indexed  to  ste^s  8. 

Upon  entering  step  8  the  sequence  counter  is  immediately  reset  to  step  0,  the 
resting  state,  and  a  REQUEST -COM PLETED  message  is  transmitted  to  the  computer. 

In  order  to  prevent  the  sequence  controller  from  stopping  due  to  an  illegal  mes¬ 
sage  or  minor  equipment  malfunction,  a  unijunction  timer  is  started  at  the  initiation  of 
each  step.  11  more  than  18  seconds  elapses  without  indexing  from  one  step  to  another, 
the  unijunction  timer  will  trip, reset  the  sequence  counter.and  transmit  an  error  mes¬ 
sage  to  the  computer.  |  f  > 

8  .  TEXT -ACC ESS  SOFTWARE 

The  text-access  hardware  system,  described  in  the  preceding  sections,  receives 
its  input  signals  from  the  buffer/controller  of  the  INTREX -catalog  console.  These 
input  signals  are  generated  within  the  buffer/controller  by  the  6201  computer  in 
response  to  uSer  requests.  As  a  consequence,  certain  software  requirements  are 
imposed  on  the  6201  in  order  that  it  may  select  and  format  the  command  signals  prop¬ 
erly.  These  software  features  are  described  below. 

Figure  C-ll  shows  the  buffer/controller  in  relation  to  the  rest  of  the  INTREX 
system.  The  6201  serves  as  a  functional  interface  betwJen  the  user  and  the  text-access 
controller.  A  set  of  function  switches  located  at  either  the  display  station  or  at  the 
catalog  >nsole  may  be  used  as  input  devices  to  the  text-access  system.  The  6201 
formats  n-  proper  command  sequences  and  issues  them  to  the  text-access  controller. 
The  s  ys  in  is  designed  for  multiple  users  and  the  6201  maintains  the  queueing  algo¬ 
rithms  for  text-access  requests. 

Much  attention  has  been  given  to  the  problem  of  communication  between  the  6201 

computer  and  the  text-access  controller.  Flexible  operation  is  desirable  with  a  mini- 
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mum  of  information  transmission  between  devices.  The  following  discussion  of  the 
control-and-communication  features  is  given  from  the  viewpoint, of  the  6201  program¬ 
mer  who  regards  the  text-access  controller  as  another  I/O  device. 

The  6201  communicates  with  the  text-access  controller  through  a  16-bit  word 
buffer  attached  to  the  I/O  bus  of  the  computer,  as  shown  in  Fig.  C-12.  Under  program 
control  16-bit  computer  words  can  be  transferred  to,  or  from,  the  text-access  control¬ 
ler  and  the  internal  registers  and  core  storage  of  the  6201  computer.  Words  sent  to 


Fig.  C-12  Interface  of  Text-Acce*»  Controller  with  6201  Conifer 


the  text-access  controller  serve  as  commands  to  the  film  and  display  stations.  The 
word  elements  of  the  text-access  commands  are: 

Terminal  Address:  A  6-bit  device  address  indicating  the  device  to  which  the  command 
sequence  is  directed.  The  film  station  is  assigned  Device  Address  1  and  the  display 
station,  Device  Address  2.  There  is  addressing  capability  for  63  devices. j 
Fiche  and  Frame  Numbers:  The  retrieval  device  requires  a  fiche  number  in  order  to 
select  a  document  from  the  store  and  a  frame  number  in  order  to  position  a  page  for 
either  viewing  or  film  reproduction. 

F iche  Humber  :  A  12-bit  number  consisting  of  3  BCD  digits,  each  containing 
four  bits.  The  fiche  numbers  may  range  from  000  to  999  although  zero  does 
not  correspond  to  a  stored  fiche  and  '.he  retrieval  device  may  not  have  a  capac¬ 
ity  of  999.  The  fiche  number  corresponds  to  the  unit  record  of  the  retrieval 
device.  That  record  consists  of  60  separate  frames.  Each  frame  normally 
corresponds  to  a  single  page  of  the  fill  led  document.  j  j 

Frame  Number:  A  7 -bit  number  consisting  of  a  3 -bit-row  number  and  a  4- 
bit-column  number.  This  arrangement  is  shown  schematically  in  Fig.  C-13. 
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Frome  position  on  Fiche 


Fig.  C-13  Frame  Number  and  Corresponding  Fiche  Posi^on 

The  fiche  contains  60  frames  arranged  in  5  rows  and  12  columns.  The 
(1,  1)  position  would  be  referenced  by  the  binary  string  001  0001;  the 
(5,  12)  position  by  101  1100.  This  code  was  imposed  by  the  retrieval- 
device  design. 

Number  of  Frames:  A  4 -bit  number  which  indicates  the  number  of  frames  to  be 
placed  on  one  strip  of  the  reproduced  film  generated  by  the  film  station  when  a  copy  is 
requested.  Currently  hardware  requirements  set  a  maximum  of  10  frames  per  strip. 

If  the  document  exceeds  10  pages,  6201  Program-Control  issues  the  number  of  film 

t  r 

requests  needed  to  copy  the  entire  document.  Program-Control  will  also  set  a  limit 

V 

on  the  number  of  pages  a  user  may  request  at  a  given  time.  ; 
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Strip  Continue:  A  1 -bit  indicator  for  the  film  station.  If  the  bit  is  set  to  one,  the  film 

i 

station  does  not  cut  the  film  after  completing  the  current  copy  request.  The  user  is 

•  s  f, 

thus  allowed  to  place  more  than  one  dociiment  on  the  same  film  strip.  The  usual  situa¬ 
tion  will  be  to  cut  the  film  strip  after  a  given  document  has  been  reproduced,  and  so 
the  Strip  Continue  bit  would  be  zero. 

i 

Magnify:  A  I -bit  indicator  requesting  that  a  displayed  image  be  magnified. 

X  and  Y  Sector  Numbers:  Two  3 -bit  numbers  represented  in  a  oneVout-of -three  code 
used  to  give  the  X,  Y  coordinates  of  the  portion  of  the  page  (called  the  sector)  to  be 
magnified.  Figure  C-14  presents  the  coordinate  representation.  The  one-out-of-three 
code  is  selected  for  convenience  of  decoding  at  the  text-access  controller. 
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Fig.  C-14  X,Y  Coordinate  Representation  of  Poge 
Sector  for  Magnification  Option 


Command :  A  4-bit  command  code  used  for  issuing  commands  either  to  the  text-access 
controller  or  to  the  6201  buffer/controller .  These  special  codes  are  included  to  allow 
flexible  operation  and  system  modification  as  the  Intrex  experiments  'proceed.  They 
will  also  be  used  to  convey  status  information  from  the  text-access  controller  to  the 
6201. 

Word -Sequence  Indicator:  A  2-bit  code  used  to  indicate  to  the  text-access  controller 
which  of  four  different  command-word  sequences  is  currently  being  issued  by  tlie  6201. 
A  detailed  discussion  of  this  feature  follows  in  the  next  paragraphs. 

COMMAND-WORD  SEQUENCE 


f 


The  action  of  the  text-access  controller  is  directed  by  the  6201  which  formats 
the  above  data  elements  into  command  words  and  outputs  those  command  words  via 
the  I/O  bus  in  a  sequence  determined  by  the  system  state  at  the  time  the  request  is 
processed.  There  are  three  command-word  formats. 

The  firjt.  word  contains  the  Text-Acces  s -Device  Address,  the  Frame  Number, 
the  Strip-Continue  bit,  and  the  Word-Sequence  Indicator.  Designated  by  the  word 
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formats  WORD1,  WORD2  and  WORD3,  the  Word-Sequence  Indicator  has  the  following 

code : 

00  Only  WORD  l  will  be  sent  for  this  command.  The  word  sequence 
indicator  itself  is  contained  in  WORD1. 

01  WORD1  and  WORD2  will  be  issued  in  that  sequence  for  the  cur¬ 
rent  command. 

10  WORD!  and  WORD3  will  be  issued  in  that  sequence  for  the  cur¬ 
rent  command. 

11  WORD1.  WORD2  and  WORD3  will  be  issued  in  that  sequence  for 
the  current  command. 

All  commands  require  a  WORD1. 

The  second  word  contains  the  3-digit  BCD  fiche  number  and  the  4-bit  command 
code.  The  third  word  contains  either  the  4-bit  number- oi-pages  count  or  the  X  and 
Y  sector  numbers  with  the  magnify  bit  set  to  one.  Figure  C-7  summarizes  the 
command  -  word -format  information. 

DISPLAY-COMMAND  SEQUENCE 

To  command  the  text-access  system  to  retrieve  a  specific  page  of  a  stored  docu¬ 
ment  and  to  display  it  at  the  storage-tube  station,  the  6201  normally  formats  and  outputs 
a  2-word  sequence  to  the  text-access  controller.  The  first  word  contains  the  terminal 
address  of  the  display  station,  the  frame  number  (row  and  column)  of  the  page  to  be 
displayed  and  a  word-sequence  indicator  of  01.  The  strip  continue  indicator  is  not 
significant  m  this  example  and  is  ignored  by  the  control’  -r.  The  second  word  contains 
the  3-digit  fiche  number  and  the  command  code  is  zero.  (A  zero  code  is  used  for  all 
display  and  copy  commands.  The  terminal  address  determines  which  command  is  dis¬ 
play  and  which  is  copy.  )  The  text-access  controller  causes  the  retrieval  device  to 
select  the  requested  fiche  and  to  position  the  proper  frame  (page)  for  display  on  the 
storage-tube-display  device. 

The  retrieval  device  does  not  re  store  the  selected  fiche  until  a  new  request  is 
made.  If  the  next  request  is  for  another  frame  on  the  same  fiche  (as  is  often  the  case 
for  -l  user  viewing  successive  pages  of  a  single  document),  then  the  6201  sends  only 
WORD1  for  each  subsequent  request  for  the  same  document.  Currently,  a  single  docu¬ 
ment  is  completely  contained  on  one  fiche.  Such  subsequent  requests  require  a  new 
frame  number  and  the  word-sequence  indicator  set  to  00  to  inform  the  text-access 
controller  that  only  one  command  v  ord  is  being  sent  for  that  request. 

MAGNIFIED -DISPLAY  OPTION 

If  the  user  has  selected  a  page  and  has  it  displayed  on  the  storage  tube  he  may 
decide  to  have  the  disp'  station  present  an  enlarged  display  of  part  of  that  page.  The 
6201  then  formats  and  issues  a  WORD1,  WORD3  sequence  to  the  text-access  controller. 
The  only  change  in  WORD  1  from  the  previous  request  is  that  the  word-sequence 
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indicator  sets  to  tO  in  order  to  flag  a  WOR,  i,  WORD3  sequence^  The  terminal -and  - 
frame  information  is  unchanged;  however,  they  must  be  re-issued. 

WORD3  will  contain  the  X-  and  Y-sector  numbers  and  the  magnify  bit  set  to 
one.  The  number-of-frames  part  of  the  word  is  ignored  by  the  controller. 

ALTERATION  OF  COMMAND  SEQUENCE  -  MULTIPLE  USER 

The  text-access  system  is  designed  for  multiple-user  operation.  Often,  a 
second  user  may  request  a  display  or  film  copy  of  a  document  before  user  number  1 
has  finished  paging  through  his  currently-selected  document  or  magnifying  his  cur¬ 
rently  displayed  page.  Since  the  retrieval  device  itself  can  only  have  one  fiche 
selected  at  any  given  time,  the  first  fiche  must  be  restored  before  the  second  fiche 
can  be  retrieved.  The  first  user  still  has  his  image  to  view  and  he  can  still  request 
the  next  page  or  the  magnification  of  his  current  page;  however,  this  request  cannot 
be  honored  until  the  second  user's  fiche  has  been  retrieved,  processed  (filmed  or 
displayed),  and  restored.  Then  the  first  fiche  can  be  re-retrieved  by  the  6201  through 
issuance  of  a  2-word  sequence,  since  User  l's  WORD2  was  lost  in  the  processing  of 
User  2's  request.  For  User  1  to  employ  the  magnify  option,  the  complete  3-word 
sequence  is  needed  because  of  the  intervening  User-2  request.  The  6201  program 
control  determines  which  word  sequence  is  required  under  the  existing  circumstances. 

FILM-RE  PRODUCTION -COMMAND  SEQUENCE 

To  request  a  film  copy  of  a  document  without  having  it  displayed,  a  3-word 
sequence  is  issued  by  the  6201  computer.  The  only  significant  information  in  WORD3 
is  the  number  of  frames  (pages)  to  be  filmed.  Normally,  this  is  the  same  as  the 
number  of  pages  in  the  document  if  its  length  is  ten  pages  or  less.  WORD1  contains 
the  termina'  address  of  the  film  station  which  informs  the  text-access  controller 
that  the  request  is  for  filming  and  not  for  display.  The  Strip-Continue  bit  will  nor¬ 
mally  be  zero  unless  the  user  wants  to  place  more  than  one  document  on  the  same 
film  strip. 

The  more  prcbable  case  will  be  that  the  user  has  viewed  a  document  and  then 
wishes  to  have  it  copied  by  the  film  station.  If  that  fiche  has  remained  selected 
because  no  other  user  has  made  an  intervening  request,  then  the  6201  may  issue  only 
a  WORD!,  WORD3  sequence  for  filming.  No  WORD2  is  needed.  As  the  text-access 
controller  is  currently  designed,  the  computer  program  must  monitor  the  retrieval- 
st 'tion  status  and  skip  issuing  WORD2  if  the  pro,  r  fiche  is  already  selected.  If  that 
function  is  not  performed  by  the  6201,  then  the  fiche  will  be  restored  and  reselected, 
thereby  requiring  a  complete  access  cycle  _  the  retrieval  device. 

MULTIPLE -USER  PRIORITY 

Multiple -user  operation  may  cause  queues  to  develop  for  both  the  display  and 
filming  functions.  Filming  a  multiple -page  document  engages  the  retrieval  device 


for  more  time  than  the  processing  of  a  display  request.  From  a  user -convenienc e 
viewpoint,  waiting  for  a  page  display  would  seem  more  frustrating  than  delaying  the 
availability  of  a  copied  document.  Hopefuity,  the  copy  will  be  processed  by  the  tim 
the  user  reaches  the  film-station  location.  Given  these  considerations  a  higher 
priority  has  been  assigned  to  display  requests  than  to  film  requests,  and,  therefore 
the  6201  scheduling  algorithm  will  favor  the  display  queue  over  the  copy  queue. 
Experiments  with  the  system  may  indicate  other  requirements  on  the  scheduling 
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